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Priority date (day/month/year) 
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Applicant's or agent's file reference 
P04646PC00/HA 


FOR FURTHER ACTION ^ ee Notification of Transmittal of International 

Preliminary Examination Report (Form PCT/IPEA/4 1 6) 


International application No. 
PCT/SE00/01079 


International filing date (day/ month/year) 

26.05.2000 


Priority date (dqy/month/year) 
28.05.1999 


International Patent Classification (IPC) or national classification and IPC7 
C 07 K 14/225, A 61 K 39/12 


Applicant 

ACTIVE BIOTECH AB et al 



1 . This international preliminary examination report has been prepared by this International Preliminary Examining 
Authority and is transmitted to the applicant according to Article 36. 



2. This REPORT consists of a total of 



sheets, including this cover sheet. 



^><] re P° rt is ^so accompanied by ANNEXES, i.e., sheets of the description, claims and/or drawings which have 

been amended and are the basis for this report and/or sheets containing rectifications made before this Authority 
(see Rule 70.16 and Section 607 of the Administrative Instructions under the PCT). 



These annexes consist of a total of 



sheets. 



3. This report contains indications relating to the following items: 
Basis of the report 
Priority 

Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 



Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 

Certain documents cited 



I 


El 


II 


□ 


III 




IV 


□ 


V 




VI 


□ 


VII 


□ 


VIII 


□ 



Date of submission of the demand 
20.12.2000 


Date of completion of this report 
24.09.2001 


Name and mailing address of the IPEA/SE 
Patent- och registreringsverket Telex 
Box 5055 17978 
S-102 42 STOCKHOLM PATOREG-S 

Facsimile No. 08-667 72 88 " 


Authorized officer 

Yvonne Siosteen/EO 
Telephone No. 08-782 25 00 




INTERNATIONAL PRE 



ARY EXAMINATION REPORT 



ational application No. 

PCT/SE00/01079 



I. Basis of the report 



I. With regard to the elements of the international application:* 
| | the international application as originally filed 



the description: 

P a s es 1-3 and 5-13, 

pages 

pages 4 



67 



, as originally filed 



, filed with the demand 



, filed with the letter of 05 . 09 . 2001 



the claims: 

pages 

pages 

pages 

pages 15 and 16 



, as originally filed 

, as amended (together with any statement) under article 19 

. filed with the demand 



^ the drawings: 
pages 1-4 

pages 

pages 



, filed with the letter of Q5 . Q9 . 2001 



, as originally filed 



. filed with the demand 



, filed with the letter of 



^ the sequence listing part of the description: 

pages 14-66 

pages 

pages 



, as originally filed 



. filed with the demand 



, filed with the letter of 



2. With regard to the language, all the elements marked above were available or furnished to this Authority in the language in which 
the international application was filed, unless otherwise indicated under this item. 

These elements were available or furnished to this Authority in the following language which is: 

| | the language of a translation furnished for the purposes of international search (under Rule 23. 1(b)). 
| | the language of publication of the international application (under Rule 48.3(b)). 

I I the language of the translation furnished for the purposes of international preliminary examination (under Rules 55.2 and/ 
1 — 1 or55.3>. 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the international 
preliminary examination was carried out on the basis of the sequence listing: 

| | contained in the international application in written form. 

| | filed together with the international application in computer readable form. 

| | furnished subsequently to this Authority in written form. 

[ | furnished subsequently to this Authority in computer readable form. 

I I The statement that the subsequently furnished written sequence listing does not go beyond the disclosure in the 
international application as filed has been furnished. 

□ The statement that the information recorded in computer readable form is identical to the written sequence listing has 
been furnished. 

4. | | The amendments have resulted in the cancellation of: 

1 1 the description, pages 

1 1 the claims, Nos. 

| 1 the drawings, sheet/fig 

^ 1 | This report has been established as if (some of) the amendments had not been made, since they have been considered to go 
• 1 — I beyond the disclosure as filed, as indicated in the Supplemental Box (Rule 70.2 (c)).** 

* Replacement sheets which have been furnished to the receiving Office in response to an invitation under Article 14 are referred to 
in this report as "originally filed" and are annexed to this report since they do not contain amendments (Rules 70. 16 
and 70.17). 

** Any replacement sheet containing such amendments must be referred to under item I and annexed to this report. 
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fional application No. 
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III. Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 

I . The questions whether the claimed invention appears to be novel, to involve an inventive step (to be non obvious), or to be 
industrially applicable have not been examined in respect of: 

| | the entire international application, 
claims Nos. 10 and 11 



because: 

the said international application, or the said claims Nos. 1 0 and 1 1 



relate to the following subject matter which does not require an international preliminary examination (specify): 

See PCT Rule 67.1, (iv) . : Methods for treatment of the human or 
animal body by surgery or therapy, as well as diagnostic 
methods . 



| | the description, claims or drawings (indicate particular elements below) or said claims Nos. 
are so unclear that no meaningful opinion could be formed (specify ): 



[ | the claims, or said claims Nos. are so inadequately supported 

by the description that no meaningful opinion could be formed. 

[ | no international search report has been established for said claims Nos. 



2. A meaningful international preliminary examination cannot be carried out due to the failure of the nucleotide and/or amino acid 
sequence listing to comply with the standard provided for in Annex C of the Administrative Instructions: 



| | the written form has not been furnished or does not comply with the standard. 

| | the computer readable form has not been furnished or does not comply with the standard. 
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V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 



] . Statement 

Novelty (N) 

Inventive step (IS) 



Claims 
Claims 



1-9, 12-13 



Claims 1-9, 12-13 
Claims 



YES 
NO 

YES 
NO 



Industrial applicability (I A) Claims 1-9 , 12-13 

Claims 



YES 
NO 



Citations and explanations (Rule 70.7) 

The claimed invention relates to peptides encoded by 
nucleotide sequences from Salmonella for use in medicine. The 
sequences encode two fimbrial structures of Salmonella 
enterica subspecies I and Salmonella enterica subspecies I 
serovar Typhi. It has been found that SEQ ID NO 1 is specific 
to all Salmonella enterica subspecies I. The peptides and the 
nucleotide sequences encoding them are very suitable for use 
in diagnosis and in vaccine preparations. 

It is already known from Folkesson et al . , Abstracts of the 
General Meeting of the American Society for Microbiology 97, 
page 219 (1997) that the fimbrial genes safA, safB and safe 
from Salmonella have been identified. 

The genes safA, safB and safe are encoded by parts of the 
nucleotide sequence SEQ ID NO 1 of Sequence listing No.l of 
the present application. 

The sequences have, however, not been disclosed in the prior 
art. Neither have the unique specificities of the genes been 
known before. Because of their specificity they are very 
useful in vaccine preparations. 

Therefore the claims are considered to be novel and to involve 
an inventive step. 
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INTERNATION 



ARCH REPORT 



Interna^ronz 



Internarohal application No. 

PCT/SE 00/01079 



A. CLASSIFICATION OF SUBJECT MATTER 

IPC7: C07K 14/225, A61K 39/112 

According to International Patent Classification (IPC) or to both national classificatio n and IPC 

B. FIELDS SEARCHED — 
Minimum documentation searched (classification system followed by classification symbols) 

IPC7: C07K, A61K 

Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 

SE,DK,FI,N0 classes as above 



Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category * 



Citation of document, with indication, where appropriate* of the relevant passages 



Relevant to claim No. 



Abstracts of the 97th General Meeting of the 

American Society for Microbiology, May 4-8,1997, 
Miami Beach, Florida, A. F0LKESS0N et al: D-67. 
"Cloning and Characterization of Genes Encoding a 
Novel Salmonella spp. Adhesive Structure". See 
abstract . 



Molecular Microbiology, Volume 33, No 3, 1999, 

Anders Folkesson et al., "j Multiple insertions of 
fimbrial operons correlate with the evolution of 
Salmonella serovars responsible for human disease", 
page 612 - page 622, see entire document 



1-4,6,8,10, 
13-14 



1-14 



| Further documents are listed in the continuation of Box C. | | See patent family annex. 



E" 



Special categories of cited documents T* 

document defining the general state of the art which is not considered 
to be of particular relevance 

erher document but published on or after the international filing date "X* 

document which may throw doubts on priority claim(s) or which is 

cited to establish the publication date of another citation or other 

■special reason (as specified) »y~ 

document referring to an oral disclosure, use, exhibition or other 
means 

document published prior to the international filing date but later than 
the pnority date claimed 



later document published after the international filing date or priority 
date and not in conflict with the application but cited to understand 
the principle or theory underlying the invention 

document of particular relevance: the claimed invention cannot be 
considered novel or cannot be considered to involve an inventive 
step when the document is taken alone 

document of particular relevance: the claimed invention cannot be 
considered to involve an inventive step when the document is 
combined with one or more other such documents, such combination 
being obvious to a person skilled in the art 

document member of the same patent family 



Dale of the actual completion of the international search 



2 October 2000 



Date of mailing of the international search report 

2 3 -10- 2000 



Name and mailing address of the ISA/ 
Swedish Patent Office 
Box 5055, S-102 42 STOCKHOLM 
Facsimile No. + 46 8 666 02 86 



Authorized officer 



Henrik Nilsson/GH 

Telephone No. +46 8 782 25 00 



Form PCT/ISA.210 (second sheet) (July 1992) 
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SEARCH REPORT 
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1 application No. 
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Box I 



Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



This international search report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 



l - £3 Claims Nos.: 10-11 

because they relate to subject matter not required to be searched by this Authority, namely: 

see next sheet * 



1 □ 



Claims Nos.: 

because they relate to parts of the international application that do not comply with the prescribed requirements to such 
an extent that no meaningful international search can be carried out, specifically: 



3. I | Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 
Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: 



1 . As all required additional search fees were timely paid by the applicant, this international search report covers all 
searchable claims. 

2. j~| As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 

of any additional fee. 

3. r~] As only some of the required additional search fees were timely paid by the applicant, this international search report 

covers only those claims for which fees were paid, specifically claims Nos. : 



4. Q No required additional search fees were timely paid by the applicant. Consequently, this international search report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest 



|~~| The additional search fees were accompanied by the applicant's protest. 
| [ No protest accompanied the payment of additional search fees. 
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CH REPORT 
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InteniaiSBffapplication No. 
PCT/SE00/01079 



Claims 10-11 relate to methods of treatment of the human 
animal body by surgery or by therapy/diagnostic methods 
practised on the human or animal body/ Rule. 39.1. (iv). 
Nevertheless, a search has been executed for these claims, 
search has been based on the alleged effects of the 
compounds/compositions . 



or 



The 
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Rec'd PCT/PTO 2 6 NOV 2001 




To 



THE SWEDISH PATENT OFFICE 
Stockholm 



International patent application no PCT/SEOO/01079 
Our ref: P04646PC00/HA:sd 



With reference to our telephone communication of September 13, 2001, we wish to file the 
enclosed page 16, comprising amended claim 14. 

The amendment consists of a specification of the purpose of the diagnostic method wherein 
the primers or probes of the invention are used, by adding at the end of the claim the passage 
"for the purpose of detecting Salmonella enterica subspecies I". A basis for this amendment 
may be found in the description, e.g. on page 7, lines 20-22 and lines 27-29. 

Stockholm, September 13, 2001 
Active Biotech AB et al 

by: 



Harriet Allee 



OR LUOWIG BR ANN PATENTBYRA AB. Intellectual Property & Law Firm. 

Stockholm Office: Box 171 92, S-104 62 Stockholm. Sweden. Visiting address: Maria Skolgata S3. Tel: +46 (0)8 429 10 00. 
(Non-Office hours: ^29 10 99) Fax: +4*5 (0)S 429 10 70 (Patents). Fax: +46 (0)8 429 10 90 (Trade Marks). 6-nwil: branny rann.se 
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9. A vector vaccine for the protection against diseases caused by Salmonella en- 
terica subspecies I serovar Typhi, comprising a host in which a recombinant vector 
comprising a nucleic acid sequence selected from Sequence Listing No. 2 (SEQ ID 
NO 2), has been inserted and, optionally, a pharmaceutically acceptable carrier. 

. 5" • 

10. A method for protection against diseases caused by Salmonella enterica sub- 
species I, comprising administering a vaccine according to any of claims 4, 6, and 
8. 

10 11. A method for protection against diseases caused by Salmonella enterica sub- 
species I serovar Typhi, comprising administering a vaccine according to any of cla- 
ims 5, 7, and 9. 

12. Antibodies directed against a peptide encoded by a nucleotide sequence se- 

15 lected from Sequence Listing No. 1 (SEQ ID NO 1) and Sequence Listing No. 2 (SEQ 
ID NO 2), for use in a diagnostic method. 

13. Peptide encoded by a nucleotide sequence selected from Sequence Listing No. 
1 (SEQ ID NO 1) and Sequence Listing No. 2 (SEQ ID NO 2) 7 for use in a diagnostic 

20 method. 

14. Primers for, or probes that hybridize with, a nucleotide sequence selected 
from Sequence Listing No, 1 (SEQ ID NO 1) and Sequence Listing No. 2 (SEQ ID NO 
2), for use in a diagnostic method for the purpose of detecting Salmonella enterica 

25 subspecies I. 
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09/926596 

Rec'd PCT/PTO 2 6 NOV 2001 



To - - 

THE SWEDISH PATENT OFFICE 
Stockholm 




<: 
cr 

CO 



International patent application no PCT/SEOO/01079 
Our ref: P04646PC00/HA:sd 



With reference to our telephone communication of September 2, 2001, we wish to file the 
following pages, comprising amendments as specified herein below: 



Amendments 



In the description: 

On page 4. under the heading "Sequence listing" , the words " Sequence Listing No.l. herein 
referred to as" have been inserted before "SEQ ID NO l'\ and the words "Sequence Listing 
No. 2, herein referred to as" have been inserted before "SEQ ED NO 2", 



In the claims: 

In new claims 1-4, 6, 8 and 12-14 the words "SEQ ID NO 1 " have been put in parenthesis and 
the words "Sequence Listing No. 1 " have been inserted before the parenthesis. 

In new claims 1-3, 5. 7. 9 and 12-14 the words "SEQ ID NO 2" have been put in parenthesis 
and the words "Sequence Listing No.2" have been inserted before the parenthesis. 



Furthermore: 



New claim 1 

line 1 : "protein" has been amended to "peptide"; 

lines 1 -2: "the group consisting of " has been deleted: 

line 2: "or parts thereof 1 has been deleted; 

New claim 2 

line 1 : "the protein" has been amended to "a peptide"; 

line 2: "the group consisting of " has been deleted; 

line 3: "or antigenic fragments thereof has been deleted; 



New claim 3 

line 1 : "the group consisting of " has been deleted; 

line 2: "or parts thereof has been deleted; 



OR LUDWfG flHANN PATE NTS YR A AB. Intellectual Property & Law Firm. 

Stockholm orrlce: Box 171 92, S-104 62 Stockholm. Sweden. Visiting address: Maria Skolgaia 83. Tel: +46 (0)6 4-29 10 00. 
(Non-Office hours: 4-29 10 99) Fax: +46 (0)8 429 10 70 (Patents). Faxi +46 (0)S 429 10 80 (Trade Marks). E-mail: brann@brann.se 
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\ % 

New claim 4 

line 2: "the protein" has been amended to "a peptide"; 

"or parts thereof 1 has been deleted; 

"the nucleotide sequence according to" has been amended to "a nucleotide 
sequence selected from"; 
line 4: "the protein encoded by SEQ ID NO 1" has been amended to "said peptide 1 '; 

"or antigenic fragments thereof 1 ' has been deleted; 

New claim 5 

line 2: "the protein, or parts thereof," has been amended to "a peptide"; 

line 3: "the nucleotide sequence according to" has been amended to "a nucleotide 

sequence selected from"; 
lines 3-4: "the protein encoded by SEQ ID NO 2" has been amended to "said peptide"; 

"or antigenic fragments thereof has been deleted; 

New claim 6 

line 2: "SEQ ID NO 1, or parts thereof 1 has been amended to "a nucleic acid sequence 

selected from Sequence Listing No. 1 (SEQ ID NO 1)"; 

New claim 7 

lines 2-3 : "SEQ ID NO 2, or parts thereof has been amended to "a nucleic acid sequence 
selected from Sequence Listing No. 2 (SEQ ID NO 2)"; 

New claim 8 

line 3: "SEQ ID NO 1, or parts thereof has been amended to "a nucleic acid sequence 

selected from Sequence Listing No. 1 (SEQ ID NO 1)"; 

New claim 9 

line 3: "SEQ ID NO 2, or parts thereof has been amended to n a nucleic acid sequence 

selected from Sequence Listing No. 2 (SEQ ID NO 2)"; 

New claim 12 

line 1 : "the protein" has been amended to "a peptide"; 

line 2: "the group consisting of T has been deleted; 

lines 2-3 : "or antigenic fragments thereof has been deleted; 

New cla im 1 3 

line 1 : "Protein" has been amended to "Peptide"; 

line 1-2: 4t the group consisting of has been deleted; 
line 2: "or parts thereof has been deleted; 

New claim 14 

line 2: "the group consisting of has been deleted. 



3 
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Basis for the amendments - 
The description 

The insertion on page 4 in the description is for a clarifying purpose only: it makes clear that 
by the wording "SEQ ID NO 1 " (and 2 respectively) is meant "Sequence Listing No. 1" (and 
2 respectively), conforming better to the established terminology within the field. 

The claims 

New claim 1 refers to the use of a peptide (instead of a protein) 

A basis for the use of a peptide may be found e.g. on page 6, first paragraph, where a method 
of obtaining a peptide according to the invention is outlined, and on page 6 7 third paragraph, 
where medical uses of such peptide are mentioned. 

Similar amendments (from protein to peptide) to claims 2-5 and 12 and 13 have the same 
basis in the description. 

All other amendments to the claims relate to linguistic improvements. 

Stockholm, September 4, 2001 
Active Biotech AB et al 



by: 



Harriet Allee 



of respective phage insert is represented by horizontal bars. Name and size of 
the phage inserts axe indicated on the left side of the figure. 
Figure 2: Schematic representation of the pTY52 cosmid comprising the tcf 
. operon (SEQ ID NO 2). 
5 A tcf specific PCR fragment of 1 11 05 bp was cloned into the Expand vector I 
cosmid (Roche). The insert is represented with a thick black line whole vector 
sequences are represented with thin lines. Relevant restriction sites sequences 
are indicated. The position of the tef-operon, i.e. tc/A, B, C and D (SEQ ID NO 
2), is represented by a shaded arrow. 
10 Figure 3: The phylogenetic distribution of the identified genes on the cs7 insert 
was investigated using the well defined SARC collection, see Example 1 . 
Figure 4: A 2 kb large internal EcoR I fragment was used as a probe in a 
Southern blot of the SARC collection, see Example 2. 

15 Sequence listing 

Sequence Listing No. 1, herein referred to as SEQ ID NO 1, — DNA sequence of 
the genes encoding the precursor of the saf fimbrie unit of Salmonella enterica 
subspecies I. 

Sequence Listing No. 2, herein referred to as SEQ ID NO 2, — DNA sequence of 
20 the genes which encode the precursor of the tcf fimbrie unit of Salmonella 
enterica subspecies I serovar Typhi. 

Deposit information 

The phages carrying the inserted SEQ ID NO 1, i.e. phages clones Bl, D1,F11 
25 and N10 (see Figure 1) have been given the ECACC Accession numbers 
99051922, 99051923, 99051924, and 99051925, respectively. 
The cosmide carrying the inserted SEQ ID NO 2, i.e. cosmide pTY52 (see Figure 
2) has been given the ECACC Accession number 99051926. 
The depositions were made May 19, 1999. 

30 

Detailed description of the invention 

The present invention is based on the finding that two fimbria! operons, the saf 
operori and the tcf operon, are specific for Salmonella enterica subspecies 1 
bacteria- Due to their specificity they can be used to provide vaccines against 
35 Salmonella enterica subspecies I as well as detection methods for Salmonella 
enterica subspecies I. The saf operon is specific for all Salmonella enterica 
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FIMBRIAL PROTEINS , 
Field ot the invention 

The present invention is based on the finding that two fimbrial operons, the saf 
operon and the tcf operon, are specific for Salmonella enterica subspecies 1 
bacteria and therefore have therapeutic use. Due to their specificity they can be 
5 used to provide vaccines against Salmonella enterica subspecies I as well as for 
detection of Salmonella enterica subspecies I. The saf operon is specific for all 
Salmonella enterica subspecies 1 bacteria and the tcf operon is specific for the 
serovar Typhi of Salmonella enterica subspecies 1. 

10 All or part of the DNA-sequences of the genes encoding these proteins can be 
used as active agents in a vaccine against diseases caused by the Salmonella 
enterica subspecies I bacterial strains or for detection of said bacterial strains. 

The present invention also relates to methods of isolating these fimbrial 
15 proteins, to antibodies directed against these proteins, and to a vaccine 

composition comprising these proteins or antibodies directed against these 
proteins for use in the treatment of infections caused by the Salmonella spp. 
The fimbrial proteins according to the invention or antibodies directed against 
them can be used for detection of Salmonella spp. bacteria. 

20 

Background of the invention 

The members of genus Salmonella spp colonize and infect a wide range of 
different organisms. Many cause gastroenteritis and enteric fever in humans 
and domesticated animals while others are not associated with human disease 

25 (Saylers et al, 1994). The genus has been divided into two species, Salmonella 
bongori and Salmonella enterica where enterica can be further subdivided into 
seven subspecies, designated I, II, Ilia, Illb, IV, VI, and VII (Reeves et al, 1989). 
Salmonella enterica subspecies I are preferentially associated with warm- 
blooded animals. Over 99% of all clinical Salmonella isolates are strains 

30 belonging to this subspecies, including serovars Typhimurium and Enteritidis, 
which are the major causes of Salmonella induced gastroenteritis in humans, 
and Typhi, the human specific causative organism of typhoid fever, the most 
severe form of human salmonellosis (Popoff et al, 1992). 
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Salmonella enterica subspecies I consists of over 1300 different serovars and is 
preferentially associated with warm-blooded animals (Baumler, 1997). Over 
99% of all clinical Salmonella isolates are strains belonging to this subspecies, 
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including serovars Typhimurium and Enteritidis, which are the major causes of 
Salmonella induced gastroenteritis in humans, and Typhi, the human specific 
causative organism of typhoid fever, the most severe form of human 
salmonellosis (Popoff and Le Minor, 1992). 

Today gastroenteritis and enteric fever can neither be prevented nor treated 
with good results. Typhoid fever is a substantial public health problem in 
developing countries. Each year 33 million people become ill and over 500 000 
people die from this infection (American Institute of Medicine, 1986). Typhoid 
fever can be prevented by vaccination with attenuated bacteria, such as Ty2 1 
and Vi vaccines and whole cell vaccines. Whole cell vaccines show a high 
incidence of side effects (Ashcroft et al, 1964, Yugoslav Typhoid commission, 
1964). The vaccines consisting of attenuated strains of Salmonella typhi suffer 
from serious drawbacks. They must be administered as three or four spaced 
doses in order to stimulate protective immune responses (Levine et al, 1989). 
The treatment of Salmonella typhi w T ith antibiotics is jeopardized since there 
are strains of Salmonella typhi that are resistant to chloramphenicol, 
ampicillin, and trimethoprim as well as ciprofloxacin (i.e. multidrug-resistant 
strains) (Rowe et al, 1997). 

Accurate detection of Salmonella enterica subspecies I is today not possible. 
Salmonella enterica subspecies I can today only be detected by antibodies 
directed against surface proteins of Salmonella enterica subspecies I. The use of 
the sequences according to the invention makes it for the first time possible to 
rapidly and accurately determine the presence of Salmonella enterica 
subspecies I. 

For many pathogenic bacteria, there is evidence that the filamentous surface 
protein structures called pili (fimbriae) are connected to the adhesion of the 
bacteria to the host cells. Pili proteins are very antigenic and are easily 
purified. Therefore pili preparations have been used as antigens for 
vaccination. 

Summary of the invention 

The invention relates to the objects as defined in the claims. The main object of 
the present invention is to provide two fimbrial proteins that are specific for 
Salmonella enterica subspecies I bacterial strains, the nucleotide sequences 
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encoding said proteins, as well as the corresponding amino acid sequences of 
for therapeutic and diagnostic use. Further are recombinant microorganisms 
provided, in which the nucleotide sequences according to the invention have 
been inserted. 

5 

An object of the present invention is to provide vaccine compositions for use in 
the treatment of Salmonella enterica infective strains, essentially pure Saf and 
Tcf fili protein of Salmonella enterica subspecies I and Salmonella enterica 
subspecies I serovar Typhi, respectively, as well as antibodies directed to these 
10 fili proteins. 

A further object of the present invention is to provide the DNA sequences of the 
genes encoding the Saf and Tcf proteins. These sequences can be used for 
recombinant production of the proteins and for the preparations of vector 
15 vaccines against Salmonella enterica subspecies 1 and Salmonella enterica 
subspecies 1 serovar Typhi, respectively, as well as for diagnostic purposes. 

Yet another object of the present invention to use purified Saf and Tcf protein 
from Salmonella enterica subspecies 1 bacteria for active or passive 
20 immunization of mammals, i.e. the proteins according to the invention can be 

comprised in a vaccine composition or be used to raise antibodies which can be 
comprised in a vaccine composition. 

Finally, an object of the present invention is to provide a method for preventing 
25 or reducing the possibility of Salmonella infection of a mammal by 

administering the vaccines according to the invention. The invention may be 
more fully understood by reference to the following drawings and detailed 
description. 

30 Brief description of the drawings 
Figure 1. 

Schematic representation of phage clones (named N10, Dl, Bl, Fll) covering 
cfc^ the entire cs7 insert of Salmonella enterica serovar Typhimurium strain 

SR^f 3181, i.e. comprising the sa/fimbrial operon, i.e. safA, B, C and D 
35 (SEQ ID NO 1). 

The clones were selected from partial Eco RI and BamHl libraries in the 
Lambda Dash II vector. The cs7 insert is represented by a bold line. The extent 
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of respective phage insert is represented by horizontal bars. Name and size of 
the phage inserts are indicated on the left side of the figure. 
Figure 2: Scft^matic representation of the pTY52 cosmid comprising the tcf- 
operon (SEQ IDSNO 2). 
5 A tcf specific PCR\fragment of 1 1 105 bp was cloned into the Expand vector I 
cosmid (Roche). Thfe insert is represented with a thick black line while vector 
sequences are represented with thin lines. Relevant restriction sites sequences 
are indicated. The position of the tc/-operon, i.e. tcf A, B } C and D (SEQ ID NO 
2), is represented by a shaded arrow. 
10 Figure 3: The phylogenetic distribution of the identified genes on the cs7 insert 
was investigated using the well defined SARC collection, see Example 1. 
Figure 4: A 2 kb large internal £coR I fragment was used as a probe in a 
Southern blot of the SARC collection, see Example 2. 

15 Sequence listing 

— n Sequence Lifting No. 1, herein referred to as SEQ ID NO 1, — DNA sequence of 
J the genes encoding the precursor of the saf fimbrie unit of Salmonella enterica 
/ subspecies I. \ 

Sequence Listing No. 2, herein referred to as SEQ ID NO 2, — DNA sequence of 
20 the genes which Encode the precursor of the tcf fimbrie unit of Salmonella 
enterica subspecie)p I serovar Typhi. 

Deposit information 

The phages carrying the inserted SEQ ID NO 1, i.e. phages clones Bl, Dl, Fll 
and N10 (see Figure 1) have been given the ECACC Accession numbers 
99051922, 99051923, 99051924, and 99051925, respectively. 
The cosmide earning the inserted SEQ ID NO 2, i.e. cosmide pTY52 (see Figure 
2) has been given the ECACC Accession number 99051926. 
The depositions were made May 19, 1999. 
30 

Detailed description of the invention 

The present invention is based on the finding that two fimbrial operons, the saf 
operon and the tc/operon, are specific for Salmonella enterica subspecies 1 
bacteria. Due to their specificity they can be used to provide vaccines against 
35 Salmonella enterica subspecies I as well as detection methods for Salmonella 
enterica subspecies I. The saf operon is specific for all Salmonella enterica 
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subspecies 1 bacteria and the tcf operon is specific for the serovar typhi of 
Salmonella enterica subspecies 1, see Examples 1 & 2. 

The main object of the invention relates to two fimbria! operons, the saf operon 
and the tc/operon, that are specific for Salmonella enterica subspecies 1 
bacteria for terapeutic use. 

Another object of the present invention is to provide vaccines against 
Salmonella enterica subspecies 1 induced gastroentritis, entric fever and 
typhoid fever. 

A further object of the present invention is to provide methods to detect 
Salmonella enterica subspecies 1. The nucleotide sequences according to the 
invention are useful for constructing vectors for use as vaccines for insertion 
into attenuated bacteria in constructing a recombinant vaccine, for insertion 
into a viral vector in constructing a recombinant viral vaccine, or for direct 
inoculation as a nucleic acid vaccine. The pili proteins according to the 
invention, or antigenic fragments thereof, can be used for active immunization 
and antibodies directed against them can be used for passive immunization. All 
these applications of the sequences according to the invention are obtained by 
applying standard techniques known to the man ordinary skilled in the art. 

Vaccines against Salmonella enterica subspecies I. 

The genes encoding the saf and tcf fimbrial structures, or fragments thereof, 
may be incorporated into a bacterial or viral vaccine comprising recombinant 
bacteria, virus or fungi which are engineered to produce one or more 
immunogenic epitopes of the saf or tcf fimbrial structures. In addition, the 
genes encoding the saf and tcf fimbrial structures, or part thereof, operatively 
linked to regulatory elements, can be introduced directly as a nucleic acid 
vaccine, to elicit a protective immune response. 

The proteins or antigenic fragment thereof, deduced from the nucleic acid 
sequences of the present invention are useful alone or in conventional vaccine 
mixtures in the vaccine compositions according to the invention. The proteins 
could be produced by chemical synthesis or recombinant expression according 
to conventional methods. 
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The proteins and peptides according to the invention can be obtained by using 
a host organism transformed or transfected with an expression vector obtained 
by insertion of a gene according to the invention, or part thereof, into a vector 
in a conventional manner. The vector which is used to construct the expression 
vector is not particularly limited, but specific examples include plasmids such 
as pET (Stratagen) and the like; and phages such as M13 (NEB), phage display 
libraries and the like. As expression regulatory sequence can among others T7 
promotors and lac promotors be used. 

An appropriate host to be transformed or transfected with the expression vector 
can be chosen among for example E.-coli, Salmonella or Bacillus subtilus. The 
transformed or transfected host is cultured and proliferated under suitable 
conditions. 

After culturing, the peptides of the present invention may be purified by, for 
example, chromatography, precipitation, and/or density gradient 
centrifugation. The thus obtained peptides can be used as a vaccine or for the 
production of antibodies directed against said peptides, which can be used for 
passive immunization. 

The purified preparation containing one or several proteins according to the 
invention, or parts thereof, is then formulated as a pharmaceutical 
composition, as for example a vaccine, or in a mixture with adjuvants. If 
desired the proteins are fragmented by standard chemical or enzymatic 
techniques to produce antigenic segments. 

In formulating the vaccine compositions with the peptide or protein, alone or in 
various combinations, the immunogen is adjusted to an appropriate 
concentration and formulated with any suitable vaccine adjuvant. The 
immunogen may also be incorporated into liposomes, or conjugated to 
polysaccharides and/ or other polymers for use in a vaccine formulation. 

The different vaccines according to the present invention are administered to 
mammals in many different ways. These include intradermal, intramuscular, 
intraperitoneal, intravenous, subcutaneous, oral, and intranasal routes of 
administration. The vaccine doses will differ depending on circumstances such 
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as body weight, interferences with other administered medicaments etc. The 
upper limit is not critical unless the dose shows toxicity. 

The peptides and proteins of the present invention are also useful to produce 
5 monoclonal or polyclonal antibodies for use in passive immunotherapy against 
Salmonella enterica subspecies 1. Human immunoglobulin is preferred. 
Antisera is obtained from individuals immunized with proteins or peptides 
according to the invention. The immunoglobulin fraction is then enriched, for 
example by immunoaffininty or affininty chromatography. Antibodies raised in 
10 a suitable mammal or in the patient to be treated, can subsequently be 
administered locally or topically, e.g. orally to the patient. 

Detection of Salmonella enterica subspecies I in general. 
The sequences according to the invention, or part thereof, or fragments 
15 hybridizing therewith, as well as the proteins according to the invention, or part 
thereof, and antibodies directed to said proteins, or antigenic fragments 
thereof, can be used in molecular diagnostic assays for the detection of 
Salmonell enterica subspecies L 

20 Nucleic acids having the nucleotide sequence according to the invention, or any 
nucleotide sequence hybridizing therewith can be used as a probe in nucleic 
acid hybridization assays for the detection of Salmonella spp in various tissues 
and body fluids of patients. The hybridization assay may be of any type 
including; Southern blots, Northern blots, colony blots. 

25 

PCR technology is the most preferred technology for detection according to the 
invention of Salmonella enterica subspecies 1 . Primers of at least one selected 
from the 5' end and one from the 3' end can be used in PCR and other known 
tests to rapidly identify the presence of Salmonella enterica subspecies 1. This is 
30 according to conventional techniques. 

The isolated and purified proteins and peptides of the invention can be used as 
diagnostics to measure an increase in serum titer of Salmonella enterica 
subspecies I-specific antibody since they bind strongly to these antibodies. A 
35 serum test sample can be screened for Salmonella enterica subspecies I by 
methods such as for example ELISA. 
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The invention further comprises the use of antibodies directed against the saf 
and tcf fimbrie structures for quantitative or qualitative determinations of the 
pili proteins of the invention, or fractions thereof, in cells, tissues or body 
fluids. 

Detection of Salmonella enterica subspecies I by using nucleic acid 
hybridization technology 

Nucleic acid hybridization technology can also be to detect Salmonella enterica 
subspecies 1 according to the invention. The nucleic acid probes chosen from 
parts of the sequences according to the invention can be either DNA or RNA. 
DNA sequences complementary to the sequences according to the invention 
can also be used. The binding of the probe to the target sequence, i.e. the 
hybridization, must not be perfect. Variations and mutations of the sequences 
according to the invention can be used as long as they hybridize good enough 
to detect Salmonella enterica subspecies I. The preferred length of the nucleic 
acid probes is about 10 to 400 nucleotides, most preferred not longer than 100 
nucleotides. 

The nucleotide probe is preferably chosen from the parts of the sequences that 
have the least variation. In the most preferred embodiments when screening for 
SEQ ID NO 1 (the saf operon, specific for Salmonella enterica subspecies 1) a 
nucleotide probe or PCR primer selected from nucleotides 37 368-37 868 
should be avoided since this region is hypervariable. 

The nucleic acid probes according to the invention are prepared by any 
conventional method such as organic synthesis, recombinant techniques, or 
isolation from genomic DNA. 

The nucleic acid probes of the invention are labeled in a conventional manner 
to signal hybridization to target nucleic acid from Salmonella enterica 
subspecies I. The labeling may comprise a radiolabel, an enzyme, a bacterial 
label, a fluorescent label, an antibody, an antigen, a latex particle, an electron 
dense compound, or a light scattering particle. 

The probes may be provided in a lyophilized form, to be reconstituted in a 
buffer appropriate for hybridization, or the probes may already be present in 
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such a buffer. The buffer may contain a suitable hybridization enhancer, 
detergent, carrier DNA, and a compound to increase the specificity. 

Any conventional hybridization assay technique, such as dot blot hybridization, 
5 Southern blotting, sandwich hybridization, displacement hybridization and the 
like, can be used. 

The target analyte polynucleotide of a microorganism may be in various media, 
most often in a biological, or physiological specimen. In most cases it is 
10 preferred to subject the specimen containing the target polynucleotide to any 
conventional extraction, purification, and/ or isolation before conducting the 
analysis. 

The sample containing the target analyte nucleotide sequence must often be 
15 treated to convert the DNA to a single-stranded form, which may be 

accomplished by a variety of conventional techniques, such as thermal or 
chemical techniques. 

The following examples describe the isolation and specificity of the sequences 
20 according to the invention. 

EXAMPLE 1 

Identification and characterization of the saf operon. 

The present inventors found, upon investigation of a 7 kb chromosomal region 
25 on centisome 7 originally isolated from the S. typhimurium strain SR-1 1 k3181, 
a region that exhibits many of the traits that define a pathogenicity island. It 
has a lower G+C composition than the average composition of the Salmonella 
genome and includes many sequences related to different mobile genetic 
elements. The region is not present in E.coli K12, and the Salmonella specific 
30 DNA is inserted between the tRNA gene asp V and the stop codon of yafV, a 
hypothetical protein upstream of the yafH gene at 5 min in the E.coli 
chromosome. This Salmonella specific insert encodes proteins creating adhesive 
structures and other virulence factors. Sequencing revealed genes encoding a 
new fimbrial operon that they designated Salmonella Atypical Fimbriae (sq/J, 
35 due to its relatedness to a subgroup of adhesive structures forming thin 
atypical fimbriae or non-fimbrial adhesins.^ 
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The sa/operon consists of four contiguous genes, safA, safB, safC and safD 
that encode fimbrial subunit, periplasmic chaperone, outer membrane usher 
protein and alternative fimbrial subunit, respectively. The genes safA, B, C and 
sq/D encode putative proteins of 166, 244, 836 and 156 amino acids, 
respectively. Analyzes of clinical Salmonella isolates showed that DNA of 195 
out of 198 clinical isolates belonging to S. enterica subspecies I hybridized with 
safB and safC, i.e. these sequences are common to more than 99% of the 
known Salmonella enterica subspecies 1 bacteria. The inventors showed that 
58% of these clinical isolates cany the safA, see Table 1. 
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Table 1. The prevalence of the saf genes in clinical Salmonella isolates. 



Serovar 


safA 


sajrs 


SG/L 


# isolates 


S. adelaide 








1 


S. agona 




+ 


+ 


6 


S. anatum 


- 


+ 


+ 


3 


S. bareilly 


+ 






3 


S. blockley 


+ 




-r 


3 


S. bovismorbificans 


- 


+ 




5 


S. braenderup 


- 


+ 


+ 


4 


S. brandenburg 


+ 


+ 


+ 


1 


S. bredeney 


+ /- 


+ 


+ 


15 


S. Chester 


+ 




+ 


1 


S. colindale 


- 


+ 


+ 


1 


S. derby 


- 


+ 




1 


S. dubhn 


— 


+ 


+ 


1 


S. eastboume 


+ 




+ 


2 


S. emek 




+ 


+ 


1 


£>. enrennais 




+ 


+ 


o 

O 








+ 


1 


S. goettingen 




+ 


+ 


1 


S. haardt 








1 


S. hadar 


+ 


+ 




16 


S. Heidelberg 


- 


+ 


+ 


1 


S. huittingfoss 






+ 


5 


S. infantis 


-/ + 




+ 


6 


S. Java 


• 






1 


S. jauiana 


— 


+ 


+ 


1 


S. kottbus 


- 


+ 


+ 


1 


S. livingstone 








1 


S. london 


+ 






1 


o. maastncnl 


+ 


+ 


+ 




S. mbandaka 


- 


- 


- 


3 


S. monteuideo 








1 


S. muenster 




+ 


+ 


1 


S. netuport 










S. ohio 


+ 


+ 




1 


S. oraruenburg 


+ 


+ 


+ 


2 


S. panama 


+ 




+ 


3 


S. potsdam 


+ 




+ 


1 


S. rissen 








1 


S. saarbrucken 




+ 




1 


S. saint paul 


+ 




+ 


3 


S. schujartzengrund 




+ 


+ 


1 


S. Singapore 


+ 






1 


S. Stanley 


+ 




+ 


D 


S. subsp 1 4.5, 12:-:- 


+ 


+ 


+ 


2 


S. subsp 1 4.5, 12:b:- 




+ 


+ 


1 


£>. suosp / 4.5, 12:i:- 


+ 






1 


o. stiDsp 1 spont 




+ 


i 


i 


S. tennessee 


+ 




+ 


2 


S. thompson 






+ 


1 


S. typhi 






+ 


1 


S. typhimurium 


+ 


+ 




27 


S. virchouj 






+ 


7 


S. welterureden 




+ 




1 


S. wortrungton 








2 


S. subsp m 








1 
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The phylogenetic distribution of the identified genes on the cs7 insert was 
investigated using the well defined SARC collection, which showed that the 
presence of the safA, safB, safC and safD genes is restricted to S. enterica 
5 subspecies I (Fig. 3). This region is hence the first subspecies I specific genetic 
region to be identified with a broad distribution within the subspecies. Since 
the serovars of subspecies I constitute over 99% of human salmonellosis and 
are preferentially associated with warm blooded animals, it implicates a role for 
the saf adhesive organelle in the colonization of these organisms. 

10 

EXAMPLE 2 

Identification and characterization of the tcf operon. 

The present inventors found thai Salmonella enterica subspecies I serovar Typhi 
contains DNA encoding an additional fimbria! operon, the tcf operon, in the 

15 sinR-pagN intergenic region. Southern blot analysis revealed a markedly 
different restriction pattern in S. enterica serovar Typhi than the other 
subspecies I isolates, suggesting that the saf-sin region in serovar Typhi might 
carry additional DNA relative to serovar Typhimurium strains. A PGR reaction 
(using a kit from Roche) was therefore performed using a sinR (5'-GTA AAT 

20 CGC TTA GTC GCC-3') specific forward primer and a pagN (5'-TCA ACT CAA 
CCT TCA GCC-3') specific reverse primer. 

This primer pair produced, as expected, a product of 2 kb in serovar 
Typhimurium from the SARC collection, while from serovar Typhi the product 
25 was 10 kb. Thus, the neighboring sinR and pagN genes in serovar 

Typhimurium strains are separated by approximately 8 kb in serovar Typhi. 

The Typhi specific PCR product was purified, digested partially with EcoRI and 
sub-cloned into pUC18 forming a set of overlapping clones. Sequencing of the 

30 clones revealed a putative fimbrial operon designated tcf 'for Typhi Colonizing 
Factor. Four ORFs, tcfA,B,C,D, have been identified with putative proteins 
having significant homology to CooB (38% identical over 192 aa), CooA (37% 
identical over 170 aa), CooC (34% identical over 872 aa) and CooD (31% 
identical over 272 aa), respectively. The Coo proteins are involved in the 

35 biosynthesis of the CS1 colonizing factor antigens of enterotoxigenic E.coli (Fig. 
4) (Froehlich et al., 1994). The peptide of the tcfB ORF is also homologous to 
the CblA major fimbrial subunit protein (45% identical over 154 aa) of the cable 



WO 00/73336 



13 



PCT/SE00/01079 



type II pili of the cystic fibrosis-associated Burkholderia cepacia(Sajjan et al., 
1995). Down-stream of the tc/-operon two ORFs were identified with the same 
transcriptional orientation as the tcf genes. The first was designated tinR for 
Typhi insert regulator because it is homologous (33% identical over 144 aa) to 
AzlB of Bacillus subtilis, a member of the Lrp/AsnC family of transcriptional 
regulators (Belitslcy et al., 1997). tinR is followed by an ORF (tioA for Typhi 
insert orf) encoding a putative protein of 205 amino acids with no significant 
homologies to anything in the DDBJ/EMBL/GenBank databases. The above 
sequence from Salmonella enterica serovar Typhi strain RKS 3333 and the tcf 
region of the incomplete genome sequence from serovar Typhi strain CT18 ( 
http:// www.sanger.ac.uk) are 99% identical over the total length of the 
investigated region in concordance with the clonal nature of the serovar . 

A 2 kb large internal EcoR I fragment was used as a probe in a Southern blot of 
the SARC collection. This blot shows that Salmonella enterica subspecies I 
serovar Typhi (SARC2) is the only strain in the collection possessing DNA 
hybridizing to this fragment (Fig. 4). 
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Claims: 



1. \ Peptide encoded by a nucleotide sequence selected from Sequence Listing No. 
1 (SEQ ID NO 1) and Sequence Listing No. 2 (SEQ ID NO 2) for use in medicine. 

2. Antibodies directed against a peptide encoded by a nucleotide sequence se- 
lected froiri\Sequence Listing No. 1 (SEQ ID NO 1) and Sequence Listing No. 2 (SEQ 
ID NO 2) for\se in medicine. 

10 3. Nucleotide Sequence selected from Sequence Listing No. 1 (SEQ ID NO 1) and 
Sequence Listing ^p. 2 (SEQ ID NO 2) for use in medicine. 

4. A vaccine for theV>rotection against diseases caused by Salmonella enterica 
subspecies I, comprising* a peptide encoded by a nucleotide sequence selected from 

15 Sequence Listing No. 1 (SEQ ID NO 1), or antibodies directed against said peptide 
and, optionally, a pharmaceutically acceptable carrier. 

5. A vaccine for the protection against diseases caused by Salmonella enterica 
subspecies I serovar Typhi, composing a peptide encoded by a nucleotide sequence 

20 selected from Sequence Listing No. \2 (SEQ ID NO 2), or antibodies directed against 
said peptide and, optionally, a pharmaceutically acceptable carrier. 

6. A nucleic acid vaccine for the protection against diseases caused by Salmo- 
nella enterica subspecies I, comprising a nucleic acid sequence selected from Se- 

25 quence Listing No. 1 (SEQ ID NO 1) and, optionally, a pharmaceutically acceptable 
carrier. \ 

7. A nucleic acid vaccine for the protection against diseases caused by Salmo- 
nella enterica subspecies I serovar Typhi, comprisingSa nucleic acid sequence se- 

30 lected from Sequence Listing No. 2 (SEQ ID NO 2) andV^ptionally, a pharmaceuti- 
cally acceptable carrier. \ 

8. A vector vaccine for the protection against diseases caused by Salmonella en- 
terica subspecies I, comprising a host in which a recombinairt vector comprising a 

35 nucleic acid sequence selected from Sequence Listing No. 1 (SEJQ ID NO 1), has 
been inserted and, optionally, a pharmaceutically acceptable carrier. 
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9. A vector vaccine f^the protection against diseases caused by Salmonella en- 
terica subspecies I serovar^Vphi, comprising a host in which a recombinant vector 
comprising a nucleic acid sequence selected from Sequence Listing No. 2 (SEQ ID 
NO 2), has been inserted and, optionally, a pharmaceutical^ acceptable carrier. 





10. A method for protection a 
species I, comprising 
8. 



administe ring 



;ainst diseases caused by Salmonella enterica sub- 
a vaccine according to any of claims 4,6, and 



20 



11. A method for protection aj ;ainst diseases caused by Salmonella enterica sub- 
species I serovar Typhi, comprising administering a vaccine according to any of cla- 
ims 5, 7, and 9. 

12. Antibodies directed against a peptide encoded by a nucleotide sequence se- 
lected from Sequenck Listing No. 1 (SEQ ID NO 1) and Sequence Listing No. 2 (SEQ 
ID NO 2), for use in a ajagnostic method. 

13. Peptide encoded by ^nucleotide sequence selected from Sequence Listing No. 
1 (SEQ ID NO 1) and Sequerf^e Listing No. 2 (SEQ ID NO 2), for use in a diagnostic 
method. 



25 



14. Primers for, or probes that hybridize with, a nucleotide sequence selected 
from Sequence Listing No. 1 (SEQ ID NO 1) and Sequence Listing No. 2 (SEQ ID NO 
2), for use in a diagnostic method for\he purpose of detecting Salmonella enterica 
subspecies I. 
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Claims: 



1 . Peptide encoded by a nucleotide sequence selected from Sequence Listing No. 
1 (SEQ ID NO 1) and Sequence Listing No. 2 (SEQ ID NO 2) for use in medicine. 

5 

2. Antibodies directed against a peptide encoded by a nucleotide sequence se- 
lected from Sequence listing No. 1 (SEQ ID NO 1) and Sequence Listing No. 2 (SEQ 
ID NO 2} for use in medicine. 

10 3. Nucleotide sequence selected from Sequence Listing No. 1 (SEQ ID NO 1) and 
Sequence listing No. 2 (SEQ ID NO 2) for use in medicine. 

4. A vaccine for the protection against diseases caused by Salmonella enterica 
subspecies I, comprising a peptide encoded by a nucleotide sequence selected from 

15 Sequence Listing No. 1 (SEQ ID NO 1), or antibodies directed against said peptide 
and, optionally, a pbarmaceutically acceptable carrier. 

5. A vaccine for the protection against diseases caused by Salmonella enterica 
subspecies I serovar Typhi, comprising a peptide encoded by a nucleotide sequence 

20 selected from Sequence Listing No. 2 (SEQ ID NO 2), or antibodies directed against 
said peptide and, optionally, a phaxmaceuticaHy acceptable carrier - 

6. A nucleic acid vaccine for the protection against diseases caused by Salmo- 
nella enterica subspecies I, comprising a nucleic acid sequence selected from Se- 

25 quence Listing No. 1 (SEQ ID NO 1) and, optionally, a pharmaceutically acceptable 
carrier, 

7. A nucleic acid vaccine for the protection against diseases caused by Salmo- 
nella enterica subspecies I serovar Typhi, comprising a nucleic add sequence se- 

30 lected from Sequence Listing No. 2 (SEQ ID NO 2) and, optionally, a phanmaceuti- 
cally acceptable carrier. 

S. A vector vaccine for the protection against diseases caused by Salmonella en- 
terica subspecies I, comprising a host in which a recombinant vector comprising a 
35 nucleic acid sequence selected from Sequence Listing No. 1 (SEQ ID NO 1), has 
been inserted and, optionally, a pharmaceutically acceptable carrier. 




9- A vector vaccine for the protection against diseases caused by Salmonella en- 
terica subspecies I serovar Typhi, comprising a host in which a recombinant vector 
comprising a nucleic add sequence selected from Sequence Listing No. 2 (SEQ ID 
NO 2), has been inserted and, optionally, a pharmaceutically acceptable carrier. 

5 

10. A method for protection against diseases caused by Salmonella enterica sub- 
species I, comprising administering a vaccine according to any of claims 4, 6, and 
8. 

10 11. A method for protection against diseases caused by Salmonella enterica sub- 
species I serovar Typhi, comprising administering a vaccine according to any of 
claims 5, 7, and 9. 

12. Antibodies directed against a peptide encoded by a nucleotide sequence se- > 
15 lected from Sequence listing No. 1 (SEQ ID NO 1) and Sequence Listing No. 2 (SEQ 

ID NO 2), for use in a diagnostic method. 

13. Peptide encoded by a nucleotide sequence selected from Sequence Listing No. 
1 (SEQ ID NO 1) and Sequence Listing No. 2 (SEQ ID NO 2), for use in a diagnostic 

20 method, 

14. Primers for, or probes that hybridize with, a nucleotide sequence selected 
from Sequence Listing No. 1 (SEQ ID NO 1) and Sequence Listing No. 2 (SEQ ID .NO 
2), for use in a diagnostic method. 
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of respective jShage insert is represented by horizontal bars. Name and size of 
the phage inserts are indicated on the left side of the figure. 
Figure 2. 

Schematic representation of the pTY52 cosmid comprising the tc/"-operon (SEQ 
ID NO 2). 

A tcf specific PCR fragment of 11105 bp was cloned into the Expand vector I 
cosmid (Roche). The insert is represented with a thick black line while vector 
sequences are represented with thin lines. Relevant restriction sites sequences 
are indicated. The position of the tc/"-operon, i.e. tcfA, B } C and D (SEQ ID NO 
2), is represented by a shaded arrow. 
Figure 3. 

The phylogenetic distribution of the identified genes on the cs7 insert was 
investigated using the well defined SARC collection, see Example 1. 
Figure 4. 

A 2 kb large internal EcoR I fragment was used as a probe in a Southern blot of 
the SARC collection, see Example 2. 

Sequence listing 

SEQ ID NO 1 — DNA sequence of the genes encoding the precursor of the saf 
fimbrie unit of Salmonella enterica subspecies I. 

SEQ ID NO 2 — DNA sequence of the genes which encode the precursor of the 
tcf fimbrie unit of Salmonella enterica subspecies I serovar Typhi. 

Deposit information 

The phages carrying the inserted SEQ ID NO 1, i.e. phages clones Bl, Dl, Fl 1 

and N10 (see Figure 1) have been given the ECACC Accession numbers 

99051922, 99051923, 99051924, and 99051925, respectively. 

The cosmide carrying the inserted SEQ ID NO 2, i.e. cosmide pTY52 (see Figure 

2) has been given the ECACC Accession number 99051926. 

The depositions were made May 19, 1999. 

Detailed description of the invention 

The present invention is based on the finding that two fimbrial operons, the saf 
operon and the tc/ operon, are specific for Salmonella enterica subspecies 1 
bacteria. Due to their specificity they can be used to provide vaccines against 
Salmonella enterica subspecies I as well as detection methods for Salmonella 
enterica subspecies I. The saf operon is specific for all Salmonella enterica 
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Claims: 

1. Protein encoded by a nucleotide sequence selected from the group 
consisting of SEQ ID NO 1 and SEQ ID NO 2, or parts thereof, for use in 

5 medicine. 

2. Antibodies directed against the protein encoded by a nucleotide sequence 
selected from the group consisting of SEQ ID NO 1 and SEQ ID NO 2, or 
antigenic fragments thereof for use in medicine. 

10 

3. Nucleotide sequence selected from the group consisting of SEQ ID NO 1 
and SEQ ID NO 2, or parts thereof, for use in medicine. 

4. A vaccine for the protection against diseases caused by Salmonella enterica 
15 subspecies I, comprising the protein, or parts thereof, encoded by the 

nucleotide sequence according to SEQ ID NO 1 or antibodies directed against 
the protein encoded by SEQ ID NO 1, or antigenic fragments thereof and, 
optionally, a pharmaceutically acceptable carrier. 

20 5. A vaccine for the protection against diseases caused by Salmonella enterica 
subspecies I serovar Typhi, comprising the protein, or parts thereof, encoded 
by the nucleotide sequence according to SEQ ID NO 2 or antibodies directed 
against the protein encoded by SEQ ID NO 2, or antigenic fragments thereof 
and, optionally, a pharmaceutically acceptable carrier. 

25 

6. A nucleic acid vaccine for the protection against diseases caused by 
Salmonella enterica subspecies I, comprising SEQ ID NO 1 , or parts thereof 
and, optionally, a pharmaceutically acceptable carrier. 

30 7. A nucleic acid vaccine for the protection against diseases caused by 

Salmonella enterica subspecies I serovar Typhi, comprising SEQ ID NO 2, or 
parts thereof and, optionally, a pharmaceutically acceptable carrier. 

8. A vector vaccine for the protection against diseases caused by Salmonella 
35 enterica subspecies I, comprising a host in which a recombinant vector 

comprising SEQ ID NO 1, or parts thereof, has been inserted and, optionally, a 
pharmaceutically acceptable carrier. 
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9. A vector vaccine for the protection against diseases caused by Salmonella 
enterica subspecies I serovar Typhi, comprising a host in which a recombinant 
vector comprising SEQ ID NO 2, or parts thereof, has been inserted and, 
5 optionally, a pharmaceutically acceptable carrier. 

10* A method for protection against diseases caused by Salmonella enterica 
subspecies I, comprising administering a vaccine according to any of claims 4, 
6, and 8. 

10 

11, A method for protection against diseases caused by Salmonella enterica 
subspecies I serovar Typhi, comprising administering a vaccine according to 
any of claims 5, 7, and 9. 

15 12. Antibodies directed against the protein encoded by a nucleotide sequence 
selected from the group consisting of SEQ ID NO 1 and SEQ ID NO 2, or 
antigenic fragments thereof, for use in a diagnostic method. 

13. Protein encoded by a nucleotide sequence selected from the group 

20 consisting of SEQ ID NO 1 and SEQ ID NO 2, or parts thereof, for use in a 
diagnostic method. 

14. Primers for or, probes that hybridize with a nucleotide sequence selected 
from the group consisting of SEQ ID NO 1 and SEQ ID NO 2, for use in a 

25 diagnostic method. 
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1 

SEQUENCE LISTING NO. 1 

<110> Folkesson, Anders 

<120> The complete sequence of the cs7 insert in Salmonella 
enteric serovar Typhimurium 

<130> Complete sequence of the cs7 insert 

<140> 
<141> 

<160> 5 

<170> Patentln Ver. 2.1 

<210> 1 

<211> 46870 

<212> DNA 

<213> Salmonella typhimurium 

<220> 

<221> CDS 

<222> (37368) . . (37368 ) 

<223> safA putative fimbrial subunit 

<220> 
<221> CDS 

<222> (37952) . . (38689) 

<223> safB putative periplasmic chaperone 

<220> 
<221> CDS 

<222> (38713) . . (41223) 

<223> safe putative outer membrane usher 

<220> 
<221> CDS 

<222> (41245) . . (41715) 

<223> safD putative fimbria! subunit 
<400> 1 



gatacaaatc 


tcagggtgtt 


tttatacatc 


ctgtgaagta 


aaaaaaaccg 


tatcactgta 


60 


aaagggatac 


ggtttttttt 


cgtcttcaag 


aagttccacc 


gtctatcgtg 


gaatctggcg 


120 


caaatgggee 


tacgcctgga 


tgacgaacag 


gatattaccg 


ccacttcttt 


cactgtcatg 


180 


gctattttga 


tcccactgac 


atttaaggcg 


cggcctcatg 


gcggtgctta 


acegggateg 


240 


ggacatgttc 


agegcagaag 


cagactgegt 


aatgttgata 


tcactcagat 


aattaeggag 


300 


aaccgccaga 


catgcgcatc 


atcactccag 


ggcatcccac 


ttctccagca 


actccaccgg 


360 


gatctcattg 


atcacctccg 


agaaccgttt 


tcccaccagt 


ctttcagcct 


ggcgtaacag 


420 


tgggatggtc 


gggctactgg 


gttcactget 


ctcaaaccag 


egacgaatge 


accgcaggcg 


480 


ttccagcgca 


tcattacgat 


cgcgaattgg 


ccccggttca 


gcattgtgtg 


ggagggatat 


540 


accgggtggc 


gcgttgacag 


gcggcacatc 


ctctgcccgt 


geeggaaegg 


cattatccat 


600 
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aaccgcgcct gcggggtcng gcactggcgc tggcgacgga gggatttccg gcgcggttgt 660 
ctgtaccacg tcaggcagca gcgctaacaa ctgccgcagg cgtgaaaaat ctggagccag 72 0 
atcgcctaat gtctcacgcg cccacacttg caggcgttct gcgccttcct gcgcttcgcg 780 
aaacgccgcc agcggcaatg ccccacgggc ttcgaggtcg gccagctgcn ggcggacgga 84 0 
t-cgggggcc agcgcaccag ccgggcgggg agcggacaac gcgcgcncca catcccgcac 900 
ccgcaggcgc agggcggcac tgtttgacag cgtaataccg cgaatgtccg ccatcacgcc 960 
ctcgcgatcc agcagcgccg ccagggcatt actgcgcgcc agcgacgcat cttcaacaga 1020 
ttctgcggat gcgccttcgc cggtaagcag ctgtggatgg agtgcatcag accagattac 1080 
gctcagtcct gccagtcgtg tgagcatttc cgccagtccc tgtgcgccag cctgctggat 1140 
gcgaccgcgc agcagcagga tcaacacccg gacatcccta ctgcgcgtga gcagacggcg 1200 
cgcaccacgt tcaattrccg gccagtccac ggctnccggc gcactgacaa aatcaccata 1260 
ttgcgcctcc gcctgggggg cggcgcggct gaacagcagc aggcattccg gatcgtactc 1320 
cggatcgggg ccgcagggtt gttccgcgct taccggttta gtcagggaca tgtccatatc 1380 
atcactctca gtgggccaag ccgtgctcag gttcaaaaat catgccggcc accggacggt 1440 
cacgcacggc tttacccgcc caggccgtcc agccaagccg ctgcgcgccg ccgatcaccg 1500 
caggcggtgc ggcatgaggc ttcaactgca gctcaatttc ccagcaacat tcaaaaccga 1560 
tgaacgtacg caccagttct gccagcacgg gcagattgtt gccgccgggc agaaaacgca 162 0 
ggtaatcctc cagcgtgagc gggccgataa ttagccggaa tttgtactgc atatccggta 1680 
cggcccggcc gataagcgcc .ccgttgccca gcaccgaaga cngcgcgggc c-gcccagac 174 0 
gggcgacccc atcgtccgcc accgttatcc agtgcagggc gaattcttcc accgcaaaag 1800 
gcacgctgaa atagtgcgcc agcgtggcgg ccagcccgcc aggattgcgc gatccgcgta 1860 
ccagatgggc ggaggctgcc aggcgaacat gatctgacag cgggcttccg gcgctttccc 1920 
gtagaccctg cccgctgaga ctggcgatat aaaacgcaaa acggtcgtgt tccggtttgt 1980 
ccagcccgcc accagcggac tgggcgctgc gccatgcctg ccagaactgc gtcagccagc 2040 
ggcggcgaaa aatattggaa aaatgaacca gcgtcggatc gtgacgactc tctgagcggc 2100 
tcagtgccag ctcggtatag tgcagcggca acgggccgtt tggcccccat agtccgaggc 2160 
tgtacaggct caggtgcagg cgtccaccct gccagctgac ctgggcgatt tcccgtggcg 2220 
caaaggtcat cgtcggcgtc tgtcccagac ggaatttttc catccgcggc ttgccagata 2280 
cttgcctgcc ggagtaccac agagctgggc atccacgcgc cgcatcaggt tcaggaatcc 2340 
gcaacgccag ggggtctcct. gcgcctggtt gagagcgccg gtcacacgct cccccgtcgc 2400 
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ccggttctga ccggccaggt catgacatgc ccgcgctgca tcgagcgcag cgccatctgc 2460 
gagaaggcat taatggaaac atggcgggca atatagtgtt ccagcaccag accgaacagg 2520 
taaggactga taccggaaaa tccttcttcg tccacggtca gttcgcaact gacgccccgg 2580 
ccatagacca acaggccgga gccgggcagg cggcgggtca ccggggtggt tttgcagcca 264 0 
atcaggctgc gcacctggcg cgactgcggg ctgtcgtgag ccgggacaaa cagatccagc 2700 
agatcgcgca gcgcctggcc gccggtgcgg tgatccagat cggccagcgg cagataatta 2760 
aacgacaact gccggaccag ccgccaggcc atttcgcgtt cagccagcgg cggctgcggc 2 82 0 
gggcgcggcg gtctgataag acccacgccc gccaccggaa tcgccgcatc tacggccaga 28 8 0 
tcatcccggc catcacgtgg aataaggcag ggcagaccgc ggttagtcac cattgccgtg 2940 
acggtgatat ggcgcagatt ttccgggtag ggcgcttcat gctgatcaac cagcgagagg 3000 
aagacttccg agccggtaca gggggttcgg gcgccacagc ggcgggcgtt ttctgacgag 3060 
cggcgcggtt cacgacgcag tgaaaaataa cgcccgcggt tgccrccgtc actatcacgg 3120 
gtgtgataca gcggacgaaa aaccatcttc cgtgtggttt ccgcttccag gccttcgact 3180 
tcctgaacag aaaacacctc gtaatccagc ggacgggcac gatccaccac cagatgctgt 3240 
tccgtcacgc tgtgagtgac ttcaatccgg gtggtggtgc gaggaagcag gttgaccacc 3300 
ggcgtacaga acaggccgaa ctgtgcagcg tccgtctgat gaatcagcca gcccggcggc 3360 
aggcggttaa gcagtatgac aatttccgcc acattgccct gcaccttttg taacccggca 3420 
gacaatccgg tcggggtgaa gaagtaaaac cgttccggac aggcgaaaaa ttcatgcagc 3480 
agattatggc cgtgaaacac gttccaggcg agcggtagca gcccctgccc tggctccagc 3 54 0 
ccctcgtgcg ccaccgggng ttgaagattc acattcagt: cgccgccaaa gtgacczgggt: 3600 
tcaccggcca gtgcggcgac ggcgcnggta tgcagcagcn caaacaggtg tgacgcaacg 3660 
cgttcttcgc cgcagaggca aaagggcagc cgtgccggac cggccagctc gctgaaagtc 3720 
agttccccga aggttcgcag ggcgatgcgc aatgccccgg cgacatgaat attaggcggc 3780 
agatagcggt gcagggcggg catatccggc ggcgcggcgg tcaggcgcac ctcctcgatg 3 84 0 
gacagcggcc acagcgtgac gtcctggctg ccgcgaaact ggcaggcggt attttcgcct 3900 
tccgggatgg gggaaacgaa cgcggtatcg cgcggcacgg tgaccccttt cgccaggccg 3 960 
ccttcctgcg catcgggata cagctttacc actgccatcg atggcgtggg ggcgacgtaa 402 0 
ttggggctga cgacttccag taaccgctgt gtgaagcggg gaaactcggc gtcaac tttt: 4080 
agccgagtgc gggcgctcag aaagctgaac gcctcgatca tgcgttccac atacgggtcg 414 0 
gcaacatcgg ttccctgcat ccccagtcgg gcggcaattt tgggatggag ggtggcgaac 4200 
tcagcaccgg tctcccgcag gtagctcagc tcgcggccgt aacactccag tagccgcgga 4260 
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tccatgaata 


acgccctgta 


ccaaaagaac 


ctgcgtacca 


gaaactccgt 


gggatacggc 


tgtagcgtgt 


tatagctacc 


ccgccgggnt 


gcggcgttca 


gccggggttc 


gaaacggata 


tcccacctat 


gctcatacac 


aaagctgccc 


gacgccgccg 


cctgcggata 


gcgccgggcg 


agcaaaaagg 


agagatcccg 


gcgaatgacc 


tcccggccaa 


ttttctgata 


cggagcattg 


aggtggttag 


cgggtgtaaa 


acgagacgtg 


aggtacaatg 


ggccatgtcc 


agcaggctga 


gccccagcgc 


ccgcacggcg 


gtzrtcgccgg 


gacgcagggc 


gtcggatgca 


ctttccgaac 


gcgcatcgcc 


attcaccagg 


gtgatattaa 


tcggttgcgg 


cgattccagc 


gagcgtattt 


cggtgactag 


ctcaagtacc 


gggccaaggc 


cgccgccgtt 


ccactgcccg 


cccgtgtctg 


tatcaacctc 


accggtatcg 


tcatgacagg 


cgggctgcgg 


cagaagaaaa 


ccgggtcgtt 


cttcgcagcg 


aaccagctcc 


cggtacagcc 


tggcgcatag 


ccgaagctgg 


tgtagcgccc 


gaaacaggct 


gtggcggcag 


agcgcttccg 


tctgaatccc 


ctccgcaata 


gagcattccc 


tgtcagtttt 


tctcatgggc 


ggtatttcca 


ctatcgatgc 


caataatacg 


atgctcgcgg 


gcggttttcc 


cccccagttc 


cggcgagagc 


yccagccagu 






ccgggcgcgc 


ctgccaccat 


atcccgtaac 


gacgtcaaat 


ccngggtgac 


agcgtcttca 


tcttcctgcc 


cggagggccg 


caatgcctgc 


ctacccgctt 


catggccagt 


gtaagaggat 



PCT/SEOO/01079 

4 

gcaatgcggc tcagctccac atccagcgcg 4320 
tgcgtcagaa tttgtccgcg aatttcaaac 4380 
ttatcgagca agggcgtgac cctgagtgtg 4440 
atggcgcgcc ggatcgcccc gcngatatcg 4500 
gccagcggcg gcaggccata gttgagcact 4560 
tcgatgtcac cctcgtggct aatggtattg 462 0 
tccttcagtt gtaccggcgt: gacgctgata 4680 
tcacacagcc gatcaaacag cgcgggcagc 474 0 
ctcatgcgcc atcgttttcc tgagcatgaa 4800 
catcgccgtg gctggtcagc cacactttct 4860 
ggccgccctg ccaggcggcr tcccrgcaca 4920 
cgctgtaacg ggtaaagagc caggcgccgt 4 980 
cgggtttcca cagcagatca gncaggcgcg 504 0 
gcgaaaacgg cagccagata cacacgccgc 5100 
gggaaccgct gtcgctcgcc cagtcaaatg 5160 
ttatggcttc cagtgcggca ttacggtgtt 5220 
cgagcgccgc cagcagtgac tccacccaaa 5280 
gttcaccctg aaaaacggtg tggcggaaca 5340 
gggcctcctg ggtataatcg gcctccatcc 5400 
gcgaccagtc cccggccaca cacagcaact 5460 
ccggatcttc ccgaacctgc tgctccgcca 5520 
gcatcagcgc ggacagggta gcaggaagcg 558 0 
tttttctgtg tcggagtgat tcggtagtgg 5640 
cgggtcagat cgggcagtat ctcgctatgt 5700 
agttgcagga tatcgggcat tgatcccatc 5760 
gtgtccagcg tatccagaat ggcatcaata 5820 
gtgtcggtgt cgcccttttt atcgtataac 5880 
tttcggggaa acggtttagc acggaggggc 5 94 0 
tgatattcct ggcaaagctg gtgtagtgtg 6000 
gcggtttccc ggacgggaat aatatcgaac 6060 
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gggttccgcg gattgagctg ttgttgttna aaccagtcca gacccaggta cccagcaacg 6120 
gattcagatg actgaaccaa ctgaggaaac gaaacatcgg gacgggactc atcattggtg 6180 
cgggcaaggc accaggatga cagaccccat tcaatcgtat ctccatcatt cagccgcata 624 0 
cggtgatgag gttccatcac ctgctcattg acggcacagc acagtccacc agagtgatta 6300 
acaatccacc acgcctcccc gtggcgaaca agactgagcg cgatatcgtc tttcccatgt 6360 
tgctcaaaaa gcggaatgta gggggcggcg gcagtaaaca cgatgtacgt ccctgcggga 642 0 
tactgtgcgt tatccgttat accctgcgac tgtattttcc gtaatcccca catgtcttgt 6480 
ttcattatgt cccttaaatg fcatttatttt tctggaggaa acgtttaggg agttttaatt 6540 
cataaataat tctaactaaa ttcataggga gccattattg atgacacccc ttttactatt 6600 
tccgtcgtga atgcattgtt gtatgcatag atgtcttttt tgaaacatta tttcttttaa 6660 
tcccgcaacn gcgactttta tcctgctgrt: cattgattat ttaatattaa cggcntagct 6720 
gctaactagt ccccgatctg tatcaccgtt: ccgtttcgat acttcctcga ggctatcaat 6780 
aaagaattgt ttatatatct atatgcattg atgcattatt tttatgaatt tttatgtcac 6840 
aaggcacaac acatggaaac tcctgtttca cgcagtgcgt tgtatggaaa actggccggc 6900 
ccactacccc ggtcgctgga atcggcaacg gcattttgca aactacgctc taatccctgg 6960 
gttgagctga ctcactggct gcaccagtta acacagcagc ccgataacga tattctccac 7020 
gttcttcggc attaccagat ccctctttct gatgtggaga aagcgttact ccggcaactg 7080 
gatatgctgc ccgccggggc cagcgccatt agtgattttt ctcaccatat cgatctcagc 7140 
gttgaaaagg cctggacgcc ggcgagcgtc cgttacggcg ataacaaaat tcgcagcggc 7200 
tggttgctgc tggccctgct gaccacgcca gaaccgcgtc gggtactgag cagtatctgc 7260 
gcgccgctgg ccacgcttcc ggttgatgaa ctgacggaaa tactgccctc gttgatcgaa 7320 
acatcgccgg aagcgcagga gcgcccttac gacggctccg gcctggcatc agccattccc 7380 
ggtgaaagca gtcaggcgat tcccaacggc gggcaggacg gtaaatccgc gctggcaaaa 7440 
tactgtcagg acatgacggc acaggcgcgc gacggcaaaa tcgacccggt gacggggcgt 7500 
gagcatgaaa tccgcaccat gacggatatc ctgctgcgcc gtcgccagaa taatccacta 7560 
ctgactggtg aggcgggcgt cgggaaaacg gcggtcgtcg aaggtttcgc cctcgcgatt 7620 
gcgcaggggg aagtgccgcc cgcgctgcgg gaagtacggc tactggcgct ggacgttggc 7680 
gctctgtcgg ccggagccag catgaaaggc gagtttgaat cgcgtctgaa agggttactg 774 0 
gaagaggccg ggcgctcgcc gcagccggct atcctgtctg ttgatgaagt tcacactctg 7800 
gtgggcgcgg gcggcgcacc cggcacgggc gatgccgcta acctgctgaa accggcgcng 7860 
gcgcgcggca ccctgcggac tatcggcgcc accacccgga gcgaatacaa gcgccatatt 7920 
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gagaaagatc 


cggcgctgac 


ccgtcgtttc 


atccccgcaa 


tggaaacggt 

3 3 3 3 


gcgtggtctg 

3 WW 3 


ctgattctgg 


atgaagcggt 


acgtgcggcg 


cacjcaqt cqc 

*— 3 3 ^ ***3 w 3 


eggataagge 


catcagcctg 


acactacaca 


cgccgcctgc 


cagegtacaq 


a t qct a. acoot 

CL w *j n> v *J 


cgctgt tgea 


qaagcaqcxaa 


era t ac a ctaa 


fc crcr c Q c oaa t 


tttctcgctc 


t era c acr c cr qq 

*-33 3 333 


agctggaact 


ggtacatacg 


actoataaca 

w ^ W ^* *■* 


aaaccacgct 


gcaacaggcc 


qcqccqqtgg 


tgttcccgga 

^ 3 3 


agtcaqcgcg 

— ' — • 3 3 


aceggtat t c 


ccqccqggcg 

3 3 3 3 


catgatgaaa 

3 — » 3 


acccqactqq 


cgcaacgcgt 


tacegggcaa 


attcaaaccq 


ecaaageggg 

^333333 


actgggeqat 

— *— ""333 3 




ccggtqtcgg 

^33 3 3 3 


taaaaccgaa 




agaacctggt 


aaccatcaat 


tccacqctqa 


aagccgcgcc 

33 3 3 


qcccqqctat 


qaaqcqatgc 

3 3 33 3 


gtcgccaccc 


ctggagcgta 

3 3 3 3 


catgacgtcc 


acgaactctc 


ctatcaggtg 


agtacacatg 


tcgat ttcaa 


aaacaccacg 


cccatcagcc 


agatcjtgcqa 


agacccggcc 


Qcactaatcrc 


eggaattgeg 

33 w 3 ™ 3 


caagcatttc 


atcccttacc 


tqccgctgga 

3 3 33 


egaaaegteg 


cqactqgtgQ 

*"*33 w ""33™33 


cgeggatgag 

3 3 3 3 W 


tgaacagcac 


gtegcacata 


ttgtggcgtg 

3 3 3 3 3 


ctgtccaatg 


cacaccgaac 


agcacattct 


gccacgactg 

3 2 ™~ W 


aaagecgega 


tcaggcagat 


tgatategge 


ategtttget 


gaaaccggcc 


gttcgaagtg 


tataccgctc 


cccttgcggc 


aaccagttga 


aacaacagcg 


cgcagaaatt 


catcgcgcgc 


gaegtagaga 


tttacggttc 


cgagaaaaaa 
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caggtgttgc agactgeega aceggaagag 7980 
gtggatacgc tggaaaaaca ecataaegta 8040 
gtacagcttt ctcaccgcta cattcccgcc 8100 
ctggataccg ccgcggcccg cgcggcgctg 8160 
tccctgcgcc agcagctaaa ageggeggaa 8220 
aaaatgggga ttcagtcaga tgageggege 8280 
aacaatgaac tgactgeate cgaatcccgc 8340 
ttgeaggaac tgcgtctcgc agagtctgat 84 00 
gaaaeggege taagggagtg geagggegae 84 60 
gcggttgtcg eggegattgn cgccgactgg 8520 
gatgaggeca gccaggcgcr ggaaccgcct 8580 
gacggcgcgc tggegcagat tggtgaacgt 8640 
ccacgcaaac cggtgggcgt gtttacgctg 8700 
accgcgctgg cgccggcgga ggctatctac 8760 
aegagegagt tccaggaggc tcacaccgtt 8820 
gcgggctatg gcgagggtgg cgtgccgacg 8880 
gtgetgeteg acgagatcga aaaagegcac 8940 
tttgacaagg gtgggatgga ggaeggegag 9000 
ctattactca ccaccaacgt gggttccgac 9060 
ttaacgcccg aegctaeggg geccaaagag 9120 
ccggcggcat ttctgggccg cgtgacggtg 9180 
cgtggcgtga ttgcccgtct gcacctcgac 9240 
ggcgtgacgc tgaegtatag cgaggaactg 9300 
catgaaaegg gcgcgcggtt getgategge 9360 
tcgcgctact ggctgeagge catgaeggaa 9420 
gttaatggtg atgagcagat tgttttcgag 94 80 
tccgtagtgc gattttaaaa actgtacegg 9540 
ctaaaaagga aatgaaggac tatggctacc 9600 
aaccgcgcgc cgcgcgtgca gattgaatac 9660 
ategagctge cgttcgtgat ggeggegeng 9720 
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gccgacctgg ccgggaaacc gcgcgaagaa ctgccgccgg tgacggatcg caaatccctc 9780 
gatattgata ctgataactt caatgagcgc atgaaagcca ttgcgccgcg cgtggcgttc 9840 
gctgtgccga atacgctgac gggtgaaggn cagttgatgg tcgatatcac gctggaaaat 9900 
atggacgact tttcgccggc acagattgcc cgcaaggtgg acgccctgaa ccagtcactg 9960 
gaagcccgca cccaactggc gaacctccag acctacatgg atggcaaggc gggggcggaa 1002 0 
aatccggtca ataaaccgtt gcaggacccg actctgctga aaacgctggc gaatgcgccg 10080 
aaatcggccg ctacccagca agatgtgtca gcggataatg aatcagcgga ataacgtcga 10140 
atttttaagg aattttcatg gcaaacagta atatgcaggc aaccgacgcg gttgcccagg 10200 
ataccgcctc cgcatccggt gaatttgatg cgttgctgaa tcaggccttc cgacccaaga 10260 
ctacccaggc ggcaaaagcc gtggaagccg cggtgcagac gctggcgaac acgancaccg 10320 
tcagcgatga cgcctataaa agcatcaccg ctatcatcgc gcagatcgac rctaaaccga 10380 
ccgaacagat caaactgatc ctgcaacacc ccgactggca gaagccggaa tccicgtggc 10440 
gcggtatgga gcatctggtt tacaacaccg agaccgacga aaagctgaaa attccctcca 10500 
tgaatctgtc aaaagatgaa ttgcggcgca acatgaagcg ttacaagggc accgcctggg 10560 
atcaaagccc gatgttcaag aaactgtacg aagccgaata cggccagtta ggcggcgaac 10620 
cttatggctg tatcattgcg gattactact tcgaccatac accgcccgat gtggaccngc 10680 
ttggctctat cgccaaagtc gccgcgtcgg cccatgcgcc gtttatcgcc ggggcttccc 10740 
cctcggtact gcaaatggac tcctggcagg aactggcgaa tccccgcgac ctgaccaaaa 10 80 0 
tcgtcaccca gaacctggaa tacgcgccgt ggaactcgct gcgggctagc gaagaccccc 10860 
gttataccgg cctgacgacg ccgcgtcttc ttgcccgcct gccgtatggc gcaaaaacca 10920 
acccggngga cgagtttgat tttgaagaag atgcggatgg ttctgaccat accaaacacg 10980 
tctggagcaa cgcggcctac gcgatgggcg taaacatcaa ccgttccttc aaacactacg 11040 
gctggtgtac gttgattcgc ggtgtggaat caggcggtgc ggtggaaaat cttccctgcc 11100 
ataccttccc gactgacgat ggcggcgtgg acatgaaatg cccgaccgaa atcgccatct 11160 
ctgaccgccg cgaggctgaa ctggcgaaaa acggttttat cccgttgatc caccgtaaaa 11220 
actcagacta tgccgccttt atcggcgcac agtcgctgca aaaaccacag gaacactacg 11280 
atccggacgc gacggccaac gctaacctgt ctgcccgtct accgtaccng tccgcctgct 11340 
cgcgcttcgc tcacttcctc aaatgtatcg tccgcgacaa aatcggttcc tttaaagagc 11400 
gtgaggatat gcagcgctgg ctaaatgaat ggattatgaa ctatgtcgac gccgatccgg 11460 
tgaactcctc gcaagaaact aaagcccgtc gtccgctggc tgccgctgaa gtagtggtgg 11520 
aagaggtcga aggcaatcca ggttattacg acgcgaaatt ccccctgcgt ccgcatttcc 11580 
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agctngaagg gctgacggga tcgccgcgcc tggcgacaaa actgccgtca gtgaagcagg 11640 
gcaatgcctg atatatattt tgtgaatgtt taagcgagtg aagtcagaga agatagagaa 11700 
tataaagagg gatatgaaga aaagaatttc gcctcgccca cggtctcgta aaggtggggt 11760 
acgtaatgat gacacatatc cgaatgccag taacaatgcc gaagcttttt atatcattga 11820 
gtaggaaata catattatga ccataagccc aacttttcat ctgttacccg gtattgttct 11880 
gctcttttca caatatgctg tagcctggga agtcagttgc ccggctgcta ttgataccca 11940 
gtcttctgct gtgagcccga agtctgatgt cccagcggcg tggcagcttt ctccccgata 12000 
tatgtcgcgt ttatggttaa gtagtattgg ggtaacgcag ggtaaacctg aaaacctgat 12060 
ggatctcaaa ccagagacta aaaaagtaaa cggtgaaaat tggtcfcgtat gggaaacaga 1212 0 
acgtggtagc gataaagaaa ccgaccgcta ttgggtttcg tgtatccatg gtcatgaaca 12180 
gatatggrtg acgcaaccaa tacctgcttc tccractcgc tgtaagactc gtaatrccga 12240 
gggaccgcca gaagaccagt ctgtatcttt tatccgtaat tagcgatctg agacgtgaaa 12300 
atttcagtac aggttatggt ttttatcatc ggaagctatg aagcattatt tatatgcatt 12360 
aaataatgca aattcataaa ataactaaac acattatcgg taccggaaaa atatacagtc 12420 
ctctgttctc ctgaagttat tggagaagga ttctgcacgg caatgattca tctataaaca 12480 
aaaagatata gataaaatca ggtttatttt aagtaaaact taataaggat ataaaaatgg 12S40 
cttatgacat ttttttgaaa attgacggca ttgacggcga gtcaatggat gacaaacaca 12600 
aaaatgaaat tgaagtactg agctggcgct ggaatattca tcaggaaccc accatgcacg 12660 
ccggtagcgg cctcggctcc ggcaaggtct ccgccaccaa ccrggatttt gatcactata 12720 
tcgaccgcgc cagcccgaac ccgttcaaat accccgcctc cggcaagcac actccgcagg 12780 
ccattctggt tatgcgtaag gctggcggca atccgctgga gtacctcaag tataccttca 12840 
ccgacctgat tgtcgccgtg gtttccccga gcggcagcca cgatggtgaa atcgcctccc 12900 
gtgaaacggt ggagctctcc ttcagcaccg tgaagcagga atacgtggcg cagaaccagc 12960 
agggcggcag cggcggcacc atcaccgcag gctacgactt caaggccaac aaagaaatct 13020 
aacggctgtt tttccggcca gatgttatgt ctggctggtt ttattgtttt gattctaaag 13080 
gaatttacag tgaataaatg gcgtaacccc accgggtggt tatgtgcggt agctatgccr 13140 
tttgcaccgc tcctgctttc cggatgcggc agtagcgatt cgctacttga cccctaatcg 13200 
cagcggcctg gcctgagcgc gaaagcgttt tacaaggtga attctgacaa tcagaagaaa 13260 
gcggcgtcca tgaagatacg cgtcgagaat taatgaccta cacagaaict ttagaggtra 13320 
agcaaaatga acagaccttc attcaatgaa gcgtggttag ctttcaggaa ggcgaaccat 13380 
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tccgtcgccg 


atgtgggtag 


cattattggt 


tattttcaaa 


atgcctgccc 


cattcgaatg 


acagcccgta 


actctccgta 


tgcaaaggtt 


cgcgtgaatg 


atatgattga 


ttatcttact 


aacaatccga 


aacagagtga 


cttcatcggg 


ggctggagca 


atgccagagg 


acacgttaca 


tgccacttat 


taaatgaccc 


agataatgga 


atactgccgt 


gaaatggtta 


atattagtta 


agccatcctt 


tgcacaagaa 


gcattgacca 


attgggcgct 


gagtcattgt 


ctggcattag 


ccagagctac 


ggccagtgct 


tacctcgaac 


aaattgacga 


gattgcgaaa 


aaatattcag 


atcttaatac 


catgaagtgc 


atagatttta 


aaaggcgtgt 


cgagaagnaa 


aattcaaaga 


gaatttatgg 


gtaagaaaga 


gctgtacagg 


ataaatcagg 


gcttaattta 


ggtagttaaa 


ctgaaaactg 


acggcattga 


cggcgagtca 


gtactgagct 


ggcgctggaa 


tatccatcag 


ggttccggca 


aggtctccgt 


cactaatctt: 


ccgaacctgt. 


tcaaatactg 


cccttccggt 


cgtaaggctg 


gcggcaatcc 


gctggagtac 


gcaatggtat 


cgcccagcgg 


aagccaggga 


ctctccttca 


gcaccgtgaa 


gcaggaatac 


ggcaccatca 


ccgcaggcta 


cgacttcaag 


cggccagatt 


tatatctggc 


cggatttatc 


gaatggcgta 


accccactcg 


gtggttatgt 


ctttccaqat 

w w ^ w> ^5 ^3 


Qcaqcaqtaa 


CQatQCQtta 


agcgcgaaag 


cctccgataa 


ggtgaatcct 


atacgtgttt 


acgaactgaa 


aaatgacgcc 


catgacaacg 


acaaatccgt 


ccttaccgac 


cgtcccggcg 


aagagaaaaa 


actgcgtcgc 
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ggaaacgttg ggaaaaatat aactggtggt 13440 
agctatgttt tgaacgcgac agggttccca 13500 
agtggtgccg ataacaaatt ctatacttac 13560 
catactatgg gcaagcccga tcttactgtt 13620 
aagaaaggaa ttatcgcagt aaaagggcat 13 680 
ttatggaatg gcagtatctg ttcagatcag 13740 
ccatttgttc ctgaagttgg gacactgtgg 13800 
ctttcagcat aagtggaatg ctggtatggc 13860 
cacaatattc acagtcggaa ctcttaaaaa 13 920 
tatacaaaga tgatgtcgct aaaaacgatg 13 980 
anggcaaaca atctgtggag atctaccacg 14040 
ggttgaaata taacggtccg atatcatcag 14100 
tccatgacag ggaattaaat gaattaatta 14160 
tattaaaagc atacgttctc ttgctctgat 14220 
aatagttaat ctgtccacct aataaagcag 14280 
aggacagtag atatgtctta tgacattttt 14340 
atggatgaca aacacaaaaa tgaaattgaa 14400 
gaatccacca tgcacgccgg cagcggtctc 14460 
tcatttgaac attacatcga tcgcgccagc 14520 
aagcacattc cgcaggccat tcrggctatg 14580 
ctcaagtaca ctttcacaga tcrgattatt 14640 
ggggaaattg cgtctcgcga atcaattgaa 14700 
gtggtgcaga accagcaggg tggcagcggc 14 7 60 
gccaacaaag aaatttaacg gctgtttttc 14820 
attttgattt taaaggaatt tacagtgaat 14880 
gcggtagcta tgccttttgc actgctcctg 14 94 0 
cctgacctcg aatcacagcg actcgacctg 15000 
gacaaccaga agaaggccgc gcccattgag 15060 
gctctcacga cagctgatta ccggtcgctc 15120 
gatttagtgc gtcgcgacag crttatcttg 15180 
ccgctgaacg cgcagaccac ggcaatcggc 15240 
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gtactggccg gataccgcaa cctggccaaa tcggtctggc gggtaaccta caaaaccccg 15300 
gaagccccgg aaaaagcctg gtacagcagc tc cat at egg ggaaaggaaa agtgcagttg 15360 
gaggeggaac tggaacaaag cgccattgta attaeggaac gggataaatg aattatgagc 15420 
tggaatgacc gcgtagtctg gagtgaagga caatttttac tgeegcagat gtttcagcag 15480 
caagagegtt atctggaaca egtcatgeat taccgcagcc tgccrgctgac cccctttttc 15540 
tggggattca gccactacaa tattgatggc gaagcgctga acateggtaa actgatactg 15600 
aaagaggcat cagggatttt tcctgacggc aegcegttta acgcaccgga ccacaccccg 15660 
ctgccgccgc cactgaccat tetgeeggag cacctgaacc agcagatttg tctggcggta 15720 
ccggtacgcg cgccgaacag cgaagaaacc aegtttgaca ataaccegga atcattggcg 15780 
cgcttctcgg cacacgaaca cgacatccgc gacgccaact cgctgggacg tggcgcgcag 1584 0 
ttattacagc tcagccatct gcgcctgcgg ctgctgccgg aaaaggcggt gacgggcgcc 15900 
tggattggcc tgccgttgac ccgcaccacc gggttgaacc ctgacgggcg gacagatatc 15960 
gaccacgacc tgatcccgcc catcattaat tatcaggeca gttcactgat gtgtacctgg 16020 
ctgtcgtgga tcaacgatct catceggatg egggecgatt cgctggcgga acggctgacc 16080 
ggcagegaca accacggcca tgaagcagcg gaggtctccg attacctget getgeaaatt 16140 
ctcaatcgct ttgagecget gctgactcac ctggcgaaaa ccccgctggc cccggaggtg 16200 
ctgtaccgct acctgtccga actggccggg gaactctcca cctatgtgcg tccacaaacg 16260 
cgacggcccg ctgaatacaa agagtacaaa cacctgacgc cctatgccgg gttgaaatcg 16320 
ctggttgatg aggtgcagtt cctgctgaac geggtacega tccggggcgc gcagcgcatc 16380 
gagctgaaag aggggactta cggcatcctg aatgcggtgg tggccccttc cgatcttgcc 16440 
gatttcagca cgctggtact ggcgataaag gettcaatge cgaccgatgt getactgeaa 16500 
cattttgecg cccagaccaa aategggeca tccgatcgcc tgeeggaact gatccgctcg 16560 
catctgccgg ggctggcttt gcaggttctg cctgtaccac cgcgccaaat cccgcttcag 16620 
geeggataca tctattacga catccgccgc gagggagcat tgtgggaaca cattgcccgt 16680 
tacggcggga tggecatgea taccgccggg gaatttcegg ggctggagac agaactgtgg 16740 
ggagtgcgcg ataaatgaca gacagtaccc tgacgccgcc ageggeggat atgatgtcct 16800 
ttttgtccac cacgccggaa cataaggaca gtgaatatga aacgeeggta cacaccagcc 16860 
agegcaegga actcaatgtc atcgttgaag aeggtcegga cagcaaaccc cggctggctg 16920 
aaatcagege ggeggctaac ccgttgctcg ccgctgcccg gectttattg tgcgctctcg 16980 
cagccatgcc cgctaaactg gatgcggccc tggtagagee ttaccgtaat ctgctggtac 17040 
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gcgagatgca tccgtaccag acactatgcg atcaggcgaa cctgcggcgc gagcacgnac 17100 
tggcggtacg ttactgcctg tgtacggcgc ttgatgaagc cgccaataac acaacctggg 17160 
gacggcgcgg cgtctgggcc ggaaaaagcc tgctggtaac acttcatggt gaaagcgaag 17220 
gcgggataaa acttttccag accatcgggc gtctggcggc cagcttccag gagcacggca 172 8 0 
acgtactgga ggttatctac cacctgctgg ggtcgggatt tgaaggccgc tacagcgtgc 1734 0 
agccagacgg gcgtaagcaa ctggacaata ctcgccagca actgctgaca cagctttcac 174 00 
agcgtcgcga tccggttatg cccgcgctct cgcccgactt tcagggggcg ataagcggac 17460 
gactgcggcg gatgcgccgg gtgccggtct ggctgagcgc cgggatagcc ctgttggcga 1752 0 
tgctgacgct gtttggcctt tacagccacc ggatggatgt gcagaccgtc accgtacaac 17580 
agcatattga tgcgattggt ataaaactgc cgccgccgcc tgtgccggtt cataagctgc 17640 
ggccgaaaac cctgctgcca aacgaaatcg cccgcggcct gctgaccgng gacgaagacg 17700 
accagcacag cagggtggtc trccgnggcg acgccatgtt tgcgccggga cagaaaacgg 17760 
tgagtgacgc aatccggcca gtgattaaca aagcggcgcg ggaaaccgcc cgcgtgggcg 17820 
gcgcagtcac tgtaacgggc cacactgaca gccagcccat tcattcggct gaattcccat 17880 
ccaaccrggt actgtcggaa aaacgggcgc cggaagttgc ggcgttgctg acctccggcg 17940 
gcgtacctgc cggacgggta catatcgtcg gcaagggcga tacggtgccg gtggcggata 18000 
acggcagtaa agccgggcgg gcgaaaaacc gtcgggtgga aattctggta gtggagtgaa 18060 
tgaatgatga aaaaatcaac ctatgacgtg tctcatcatt cggcagtatg tggcgtgacg 18120 
ggggattact atcggatctc agcgacatat cacataacac gacctgttcg tgtcrttttg 18180 
atcaccttat grtgcccgct: anccggcggc gctttcgccg gacccccgat taacgcagga 18240 
ttcatttccc ccgataatgt caacctcagt actcaggatt tcctgaaact ttatgccact 13300 
gacaacgtac agaaaaaaga caatgcactg atgtatatgc tgggggtgga ggatgcgaca 18 3 60 
gaaggtaaag cctggtgtgg atatggccag gttgacagta taacaataaa ccatactgtg 1842 0 
ctgacccggt ttgaacagca cgcagtgaaa aagcctgatg taagggcttc aacaccaata 18480 
gaggaagcat tagttaaaaa ttttccctgt cagaggacag actcctccat aaaaactgct 1854 0 
tcccggtcat ctcccatttt atccctgacg ccggatgcgc ttaatctttc aggtaatgac 18600 
ttttttaaat tttgggtgtc tggtaatcaa cgggataaac tcagggcggg tgcctatctg 18660 
ctcggcgtgg aggatgcgac agagaacaaa ctgtggtgtg gatacgcttt atttaagacg 18720 
ccaacattaa atgaattagt ctatgtttct cttaaaaata aaaccaatga ggaactgaat 18780 
tctcgcgcgg crgaacttat cataaataaa ttaacagagt atccccgtaa tatataaaat 18840 
cantcaagtt gcaccaaggc ggcaagggag tgaacccccg ggagcgtaca ttagttcgcg 18900 
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actggggtga gcgaggaaag ccaacgcaca tgcagcctga agtacgacag gtataccctg 18960 
tgataatgtt atcgctgcca gttcagataa agtcttggtg gtaaagtctt ggtggtaaga 19020 
tattgatatg tttaaggtaa tggtctgtct gttggtgggt gttcccgcca tttcttacgc 19080 
gcacgattat gggtgtgcca cggtnggagc atcaatggag tcatcattat ttgatgccat 19140 
aaaaaatgac ctgaatatag atgtcgctac tattataaag gataaaacaa aggtagagat 19200 
tcttgatatc ccacccgtat ctaaggtcta tgcagaatct ctggccagga tggattatga 19260 
aaaagataag gccaaaaata aagcggcaat attagataaa aaatcctatt tcgatagtta 19320 
ctatgaaaat caggttaaac ctatcgtggc aaaatatacc tatattaaca aagataaaga 19380 
aaaagatatt ttcattgcat ccagctccat gaatgctgat gagtgttctg taagatttaa 19440 
tggctatatt actttatcca gggaatttta aaacattgac tattgggcgg cgagtcgaaa 19500 
tataatgcaa aaatttctta gtctgctttt ttcccggcgc gcgccggcag crgcgggcgr 19560 
tccggttctg gcgctgctgg tctggtttgt cgggccgccg gtgtcatttg acaccctgcg 19620 
cccgctggcc tccgtgggta gccgggcagc gaccattgcc ctgttgctga tgctgctggt 19680 
actgtggctg gtcaactggt cgatgagtat catcggcatc agtgtcctgt gccrggcgat 19740 
tggctccgtc acaccgctgc tggccctggg cgatgtccat ccgtttgcgc cgctgtgggt: 19800 
ccgcctgacc ctgattggtt tcatcctgct gatgtacgcg ctgtacggcc tgtaccggc: 19860 
gtggcgtgcg ctgcgtatgg atgaacaact gctgcgtcgc ttcctgcatc cgcgcgggga 19920 
agaggtaccg gtggcaggcg agatcaaagc cgacctgcgc accgtcaacc atattgtcac 19980 
gcaggccatc cggcagcrgc ggcagtcgcg ggtggatacg cccggctggc gcaaaatct: 20040 
cgagggaaaa cgctttctgc atgagccgcc gtggttcacg gtggccggca gccccggcga 20100 
cggcaaaacc acggccctgc tgaacaccgg attgcagttc ccgctggcgg agcaaacgga 20160 
gcagacttcg cgcatcctga cagtaccggg tggcggcacg ctacactgcg actggtggtt 2 022 0 
taccaacgaa gcggtgttga ttgataccgc cggacgctac gcccgccacg acgacggtgg 20280 
tgaagcgagc gccgcgcagc gcaacgccgg agagtggcag ggctttctcg gtctgctgcg 20340 
taaacatcgc cccggcgcgc cgcttaacgg cgtgatcctg acgctaaacg cggcggactt 20400 
aaccgcacag tcaccggcgg aacgcctggc ggcctgcgcc gctctgcggg cgcgactggc 2 0460 
agaactgcgc gagaccctgg ggattcgctt tccggtctat ctggtggcca ccaaaatgga 20520 
tttgttgccg gggttcagcg aatattttcg cacgctgacc agccatcctc gtgcacaaat 20580 
ctggggcttc acgttgccgc acagccgcag gcgaaaagcg ggcgacccgc aggcgctgca 20640 
cgccgcctgc gcgcaggagc tggcgcgcct gacgctgcgg ttggatcagg gactggatac 2 0700 
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ccggttacag gaagagtacg accttaaaag ccgccagcgg ctgtataccc tcccgcgcga 20760 
gttcgccgcc ctcggcgagc cgtngctgga ggctattgaa cagatcttcc tcgattcaaa 20820 
attcgatgcc acgcaacrga ataacacgcc gcgcggggtg tttttcacca gcgccgcgca 20860 
ggcgcaggcc gacgccgtgg ccgaccagtt gagtatctgg cagcgctttg tccgggcgat 2094 0 
aaaaaccgcc cgtggcgaat cctccgcctc cctcccacac gctctgccgg acggcaaccg 21000 
cagctacttc ctgcatgacc tgctgacaca gtttatttrt cgtgaagcgc acctggtgga 21060 
gccaaacctc cagtgggcct ggcgttaccg cccgctgcgc ctcggcgggc acctgctggt 21120 
accggtgctg gcattcctgt tgtggcaggg gatgcagacc agccagcaga ccaacggcga 21180 
ctatctgaat gaaatcagcg cccgcgccac ccggctggac ggtgatgtga aagcctacac 21240 
cggtaaaccg gcgatggctc ccgtcccggc actgctggac agcgcaaggg aactgtccgc 213 00 
ccggccggaa ccggacccgg acgcgccgcc gctggcccgg cgccacggcc tgtacagcgt 21360 
accgccggta accgacagcg cggcgucgct gtacaaccgt ctgccgganc aactgccgcc 21420 
gccgccgctg gtgaaacgga tggagnatgt gctggcggac gccattgccc gtcaggacag 21480 
taaagcggcc tacgatgccc tgcgcatcca tctgctactg aatctggata aagatcacga 21540 
agataaatac aacgcggcgg agatccagtc ccgggtgart: aacgatctgg ggaacagcga 21600 
cagcgtggcc gggttcggcg ggcgcgccgc cgtgctgacg catatcgaag cgctgtttga 21660 
cggcagccgg gtggtgcatt caccgtatga gaaagatgag gcgctgatcc gccaggcgcg 21720 
ggcattcctc gacggtcaca ccagtaccga gcgtatctac gcgcgggcgc tggcggcaat 21780 
ggagagcgaa gcgccgcagg agttcacgct ggtacgcgcc gtcggcgcgg atgcgggaac 21840 
ggtctttgtg cgtagcaacg gcgcgccgc- gcancggggc gtgccgggta tttctacccg 21900 
tgaaggatac cgggagctgt tcgacaaacg attaccggaa tttgcggcgg cggcgacggc 21960 
gaacgatggc tgggtgatgg gccgggagag tacgccaaaa aagctgactg acagcctgcg 22020 
cagccagata ccggggcagg agcagtctgt cgcccgcgaa gtccgccgtt tgtacctgac 22080 
ggaacatgcc cgccgctggc aggattttct ggacagtatc catagtatca acagcgccgg 22140 
ggaagagggc agttccggcc tggcctatga tttacaggtg ctgcgcaccc tggcgtcgcc 22200 
ggacccaccg ctgatgcggc tgggaaaagc ggtggtggag cagaccacgc tggtgccgcc 22260 
gccggacccg caggccagac agaaacaact ggcgcagcgc gcatccggta acgcggggaa 2232 0 
agtggtacag acggcaaaac tgccccagga tattcacccg gaagaacggc tggaaaaaac 22330 
gctggtggat gatcgtttcg ccgcgctgcg cgaggtcatt gccgggcgga cggacggcgg 22440 
acagagcggc ggtgggacga tgcagatcgc ttcgctgctg accatgctca acgagtatta 22500 
cacccaactg accattgccg atagcgcgcc ggcggcgggg acgttgccag cgcgcatcac 22560 
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toccacaqac 


aagttgcaac 


tqqaqqcqqc 


gaaactgccc 


QCQccactqa 


aaaacatcc** 


22 62 0 


actaoatctQ 


acqaaqcaqq 


qaacqcqcaa 


aaccaacgcc 




acqtqctqaa 


226S0 

A* A> W V V 


cracccacraCQ 


qaqacqatqa 

3 **33 3 ^ 3 *■* 


tQqqcQaccra 


ctgccgtgac 


qccatcqaco 


GGCGCtatCC 

^3 ^3 w w 


2274 0 


W w W w 3 W W *-* ' 


aqtccqcaqq 


aqqtcaqcqc 

33 3^3^ 


cgaggact t t 


aaccqcatct 


t CGCcaqcGG 

W 


22 800 


<w. <-* W t_ 1» 


aatactttct 


ocraocaaaca 


actaactccq 


c Caacaaaca 

^4 ^4 a a 


W «h» V_h ^ Ik^F d 


^ <L O U u 




tacaaaccoa 


ccaaaaacaa 


catnacatto 






^ ^ J ^ \J 


yt. »y^»»3 


acoaaocaqa 


tccacaacot 

V» W W W 


attttttaac 






^ ^ 17 O U 




atacaaatta 


acotoatoaa 


tataoacccD 


*H W^ V* GL w \^ C4 W W 


a a \» ^ y 3 w y cl l_ 


21040 


l„ U Q. L— Q. l_ 3 ^* W 


^3 W w *j w 


tocactacac 

w W ^ W fefc W 


W G% w 


3 » w "—3 w v». 




^. J 1 u u 


w "»* w 3 3 3 « 


cccrccicaacQ 


qctcqatqqc 


qqaaaccacc 


CICC3LCICCCGC 


Gtattcsrca 


2 3160 


W ^m* C-4 w Gl v W W %A 


acqctactqa 


uu 33 u 333 wu 


ctqqqcactq 

— L -333^-3 v -' *- 3 


tttcatctac 


tqgacgcccq 


2 3 220 




craaacQQCQQ 


tqcqqqqqcq 

W 3 W 33333 W 3 


tcaactggtg 


gaatacgact 


t ncracdati ca 

^2 ^» 


23280 




ctoaaaatca 




qaattttaac 


ccqatcaacc 

\-» Sb- 


W4 W >«p» 


23340 


aranaacttc 


aqttacccqq 




qtaatacqcc 


oqcc acre act 


qqtatcaqcc 


23400 


^^33 y ^3 


t CCTGCaQCTCt 

\w ~j ~n *w w 


^3 %^ ~J ^rffci *^ 


cncaactatc 


tGCQCtOQCO 
*-3 u y UL 33'"3 


aatGGGece t 

^1- ^» w 


23460 




aaatatcaca 


a3 awu 33 u '-3 


aaccaccaaa 


c 3 u 533 L 333 


^- tt 3 3 3 3 3 3 


2 3520 


rarateatat 




ccaaacaacc 


qatoaaaaqc 


atqacaacta 


aatqcacctiG 


23580 


L.( v *~ w ^>3^v-3 ^* 3 


ocaaacatoa 


tatocccocc 


ccqqaacacc 


atccGCtacc 


qtqqaqcttti 

a l -33 a 3 v- w ^ w 


23640 




c add a t ti 1 1 

X«r d W W *k*i ^ W 




cooaccjacca 


actacctaac 


^-33 a 3 u 3 Cw 3 


23700 


^ *4 W *g W W w 


qooataacat 


333 c * w 3 v "^ k -'3 


tqqccactqq 


tqatttacca 


OCCTCCQCCcrC 

3 , *-3 w *"3 W 33 w 


23760 




toqacoaaao 


t ttacaqqaa 


acqcaaoact 


qqctotactq 

33 WW 3 W " W 3 


a c t aci c crac cr 

3 *- w 33 w 3 33 


23820 




aqcatatcac 


qccqqacacq 


G * ,k_ 3 w 33 w 3 w 3 


qqcqqctqac 


cqaqcaQCLq 

*— 3 "3 *-*3 3 w 3 


23880 


aaccaoctat 


qqqcqatqtq 


qcaqccqqqq 

3^*** 3 WV **3333 


C C Q t CfQ t QQQ 

w *"3 **33 w 333 


cqcaqtqqct 

3 3 33 


cccrtqaqti* a 

3"— 3 *"333 


23940 


ccacataca t 

%A V** W ^» 


cgcaacgcag 


j33"" w *~3 




cqqatqaaqc 

w 33 3 3 


aceggtagta 


24000 


aa a c r c c cqq 

^ ^3 


3 3 3 3^ 


tctqccqtqq 




egggtaagae 




24060 


gcaacaccgg 


ggcaaggctg 


gttctggcag 


cagaacagtg 


aagggcggta 


tgtggatgcg 


24120 


ctgcggttgg 


tggatgaaaa 


aaatagaggt 


cagatgctgt 


gaaaaagtac 


gctgtagacg 


24180 


ttcrgtttat 


gtctgcctgt 


gcaggaatgt 


tcctgcccgt 


atttgectgg 


ggcggaacag 


24240 


atgtaaacat 


tgataacccg 


ctggcggagt 


9tgtggatat 


ccatcccgtt 


catcgtcacg 


24300 


aaacggataa 


ccttaccatt 


ctgaaaacaa 


cggccacacc 


gaaaaaaagc 


aceggtgaac 


24360 
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geggctgttt 


ctcaacacta 


accaactaca 


acggacgcgg 


aagtgcatat 


ccccctcagg 


gacgttatcc 


tttcagcttt 


gtattatcgg 


tggcgctaat 


gatacgttgt 


acaccaccgc 


gatggcagta 


agactgactt 


ttgacgggca 


taaggcgacc 


acggggttac 


cggacttaca 


ggcgataccg 


gaagggaatt 


ataaactgtt 


aaatgctgcg 


gattgtgatc 


tgggaccacc 


ggctgccgga 


acatgtgaga 


tacactgggc 


atcagcggac 


gagaagaccc 


gaaaagcctg 


tgattccacc 


aaaggttaca 


gccatggtcg 


tctgaaacag 


gagacggaaa 


aagaaaatgg 


tgtttcnggt 


cagcaaacga 


atggtggaac 


attatcgtgt 


ctggacgtgc 


tgcggttgac 


gagcaccaat 


gataaagtat 


atgactggcg 


aatgcaatgc 


gaatctggta 


agcgtaataa 


tccttattta 


tgtaattcag 


tatttatgtt 


ggctcttgtc 


agcaatatac 


cctgttcttc 


gcattataaa 


gtgcggggca 


ctgtttcctg 


caaaggagtc 


actaccatga 


gttttgtatc 


gacgacaacc 


acgccgccga 


taaccggaga 


cggaagcgtg 


gcagatgcgg 


cggaatccgc 


cgcattgccg 


gagccatcag 


gactggtgaa 


cgccgcaggt 


atggcgcagg 


atgcggtatc 


gggggcgcat 


aatgtgacgg 


tcagcggcag 


cagcctggac 


ggcggcgaaa 


cattaagcga 


gcccgacacc 


ctgaatctgg 


gctatgtctc 


ggtgggcaaa 


gatctgtgcg 


tcaacatcga 


cgggctggtc 


acggcggcgc 


gggcggcggg 


gcgcatggag 


ccgtgggcaa 


aactgctgac 


taaaaccgtg 


gtggatattc 


tggatgaggt 



PCT/SE00/01079 

15 

ccagctcacc ggcgcaggac gccgagggtit 24420 
aggggaatat ctcactggcg aagacgcagg 244 80 
tggataacca gcctgcacgg gatcagaagc 24540 
tgtaatacac agaatagtca gggagaagat 24600 
aaagctgaca tggcctggta ccgggacatt 24660 
gtggccagat aaacagtgtg tgccggangc 24720 
tattcagttt cagggggagg caccgataag 24780 
atggggccgg agtaccattc cgcgaggcca 24S40 
gaaccgggga tataatcgta tccggctgga 24900 
tgggggcaag cggggcggtt tctatatcca 24960 
tattgaagtg gaaccggngc ttttccgtat 25020 
cgaaaagaca tccacggnta atgctaagta 25080 
aaaacaataa taccgttaaa accggangaa 25140 
aatgattaca atggtggcca gacaagcaat 25200 
gtgtgctggt tattaatcaa atttctataa 25260 
aaataaaatc ataatattgt tatgttcatt 25320 
atgtgctaat ttttgtgttt ttattttcat 25380 
tggtaaatat Ctaaatataa caggctggtt 2544 0 
acggtgagtc tatttatttt aatccggtat 25500 
cacaaataat aaatccggta tgggagggct 25560 
aagtggcggt gtcaccgcag atccagtcgc 25620 
cgtzggaacag gctgcgggat cgctatttgg 25680 
agccgcggta gcagcggcgc aggctgccgc 2574 0 
ggccatcgtc tcngctgttg caggcgggcc 25800 
cgccgtaccg ccgggcgcat tactgttcgc 25860 
actgttcagc tatgtggcac agctaaaaac 25920 
cccggcggcc aacctgccgc tcaaaccgat 25980 
actggatggt ggcggtaaac gacatatcag 26040 
ccatgaaggg cgtccggtca cctatgagct 26100 
ccataccagc gactacaaag cattccagaa 26160 
nctggcggaa tatccccacc cggcggaaaa 26220 
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gcggccggtg gaaagctacc cggtacgcac ctggcaggtg cagtacggtg aaactgattt 26280 
tgattttctt cagcgactga tgcaggagtg gggcatctac tggtggtttg agcacagcga 2 634 0 
ggacagccac acgctggcgc tggcggatgc catcagcgcc cacaaagcat gcccggaccc 26400 
gccgctggtc gagtggcacc aggaagggct gaagctggac aaggagtcta tccacactat 26460 
cacggcaaac gagagcccgc ggactggaca gcgggtgctg gatgatttcg attttacgaa 2 6520 
gccacgctca ttgctggcaa acaccgtggc aaacccgcgt gaaaccggtc atgccaccta 26580 
cgagcattat gagtggccgg gagactactt cgacaagagt: gaaggcgaga tgctgacgcg 26640 
cattcgtatg gaagcgcagc gcagccccgg cagtcgggtg ctggggggag ggaatatccg 26700 
cacactcatg accggttaca ccttcacgct ggaaaactat cccaccgccg aagtcaatca 26760 
ggaacatctg ctgatgcaga ccttgctgct tgtgcaggac aacgcgcagc acagcgggca 26820 
ggaccagcac tttaccttrt ccacccgtct tgaaccgcac cccacccgcg aggtgttccg 26880 
cccgcagcgg acggtgagca aaccccacac caaagggccg cagagcgcca tcgtcaccgg 26940 
cccggcgggc caggaaacct ggacggatca gtacgggcgg gtaaaggcac agtctggtrg 27000 
ggatcgctac ggcaaaatgg atgaaaacag caccngctgg atacgcgtca gctacccgtg 27060 
ggcgggcaaa ggcttcggga cgatccagat cccgcgcatc ggccaggaag tgctggtgga 27120 
tttcaaaaac ggcgatccgg atctgccgac catcgtgggg cgtacctaca accaggacac 27180 
catgccgccg tggggactgc cgggaatggc gtcgcagagc gggatcttca gccactcgct 27240 
gtatggcggg ccaacgaacg gcaacatgct gcgttttgac gacaaaacgg gcgcggagga 27300 
agtgaagttc cacgcggaaa aagatctcaa caccacggtg aagaacaarg aaacgcatac 27360 
ggctacggtg gatcgcacta aaaccattat taaaaatgaa accaacagta tcggcgagga 27420 
cagaaacacc acggtaacga agaatgacgg cctttccgta aaactggcgc agacgatcaa 27480 
tatcggcacc acttatcgtt tagatgttgg cgatcaattc acgcttcgct gcggcaatgc 27540 
ggcgcttgtrt ttacataagg acggctccat tgagttttgt ggcaagcaac tgatgttaca 27600 
taccagcgat gtcatgcaac tgattggtaa aggcactgat atgaacccgg atggcggcac 27660 
agccgtaacc gccgatgaca ttgcccccct tctcacctct gagtgatctg aattaaacct 27720 
ggagttctca tggatcgacc ataccgcata caggaagggt gttttgtcct gcctgaaaca 27780 
cttacggatc gcagcgtcaa tatttttatc ctggagggca atgaacgaac atcgcccagc 27840 
ctgaatattt cccgcgatac gctaaaacct gatgaagacc cgcccgccta tatcgaccgc 27900 
cagatcgcac tgatgaaaaa aaacctcggt cagcaccggg tattgtcgcg agcgcctgca 27960 
caggcaggaa cgggcaatga tgcccttatg ggggaacaaa ttgccgccac ccataaaccc 2 8020 
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gggaaaacgg 


aagcgcacca 


gcgtcaggcc 


gggtttattg 


caacccctgg 


caaggtactg 


28080 


gtcttcaccc 


tgaccagtcc 


ccgtcctttt 


gatgataaag 


cagacccacc 


ctggaacacc 


28140 


tggctggcag 


gctttcagcc 


ggataaaaac 


gaataatcac 


acggaggtgt 


gaccatgtat 


28200 


gaagcagccc 


gtgtggatga 


tcctatctac 


cacaccagcg 


cgctcgccgg 


gtttcttatc 


28260 


ggcgctatca 


tcggcatcgc 


cattatcgcg 


ctcgccgcct 


ttgccttctt 


tagctgcggt 


28320 


tttcttgccg 


ggctgattct 


gggttttatg 


gccgatcaaa 


tagcctccgg 


ggtattgcaa 


28380 


ctgggcgagg 


ccatcgggcg 


ctccatccac 


cacacggcag 


gaaaaatcct 


caccggttcg 


28440 


gagaatgtca 


gcaccaacag 


tcgcccggcg 


gcgcgcgcgg 


tactgagtac 


ggtgaaatgc 


28500 


gataaccata 


tcgcagaaaa 


acgcatcgcc 


caagggccgg 


aaaatatcta 


catcaacagc 


28560 


cagcccgccg 


cccgtaagga 


tgaccacacc 


gaatgcgacg 


cggtgattga 


agacggttcg 


28620 


ccgaatgtgt 


ttcrcggcgg 


cggcacacag 


acggtactgg 


aaatcagctc 


rgaaatcccg 


28680 


gactggccgc 


gcaaggtggt 


ggatgtattg 


cttgtcgtgg 


cgagt ctgct 




2874C 


gccggggcgt 


ggcggcaggc 


ggcaaagctg 


gggacgaaat 


ttggcactaa 


acgtgccgcc 


28800 


aagtttatcg 


gcggggagct 


tgtcgggatg 


gccgrgggtg 


aggctatcag 


cgggctgctc 


28860 


agcaatccgg 


tggatgtgac 


caccgggcag 


aaaatcctgc 


tgccggaaac 


ggacttcacc 


28920 


ctgcccggcc 


gcctgccggt 


cacctgctcg 


cgtttttacg 


ccagccacct 


ggaaactgtg 


28980 


ggactgttgg 


gacggggctg 


gcggctgaac 


tgggaaacca 


gcctgcgcga 


tgacgatgaa 


29040 


cacatcacgc 


tgaccggcgt 


acaggggcgg 


gaactgcgtt 


acccgaaaac 


gatgctgacg 


29100 


cccggccacc 


agatatttga 


cccggaagaa 


cagttatacc 


tcagccgcct 


gcatgacggg 


29160 


cgttacgtgc 


tgcattacac 


cgatcgcagc 


cattacgcat 


ttggtgat" 


tgacagtgac 


29220 


ggcacggcat 


acctgctgtt 


tacggagacg 


ccgcaccgcc 


agcgcattgt 


c-tcgggcac 


29280 


gaaggaggca 


gaccggtacg 


gatagcctcc 


agcagcgggc 


atcacctgtt 


actgcaccgc 


29340 


acacagaccc 


cggcagggga 


gcggctgtcg 


cgaattgaac 


tggtgcaggg 


cggcacccgt 


29400 


ggcaacctgg 


tggagtaccg 


gtatgacgat 


aacggtcaac 


tgaccggcgt 


ggtgaaccgg 


29460 


gcgggaacgc 


aggtgcgtca 


gtttgcttat 


gaaaacgggc 


tgatgacggc 


gcacagcaat 


29520 


gcgacggggt 


tcacctgccg 


ctaccgctgg 


caggaactcg 


acggcgcgcc 


gcgcgcgacg 


29580 


gagcacgaca 


ccagtgacgg 


cgaacattac 


cgctccgact 


atgattttgc 


cgcaggcacc 


29640 


accaccgtca 


ccggcaggca 


gggggagaca 


tggcagtggt 


ggtacgacag 


ggaaacgtat 


29700 


atcaccgcgc 


accggacgcc 


gggcggtgga 


atgtaccgct 


tcacgtacaa 


cgaagaccac 


29760 


ttccctgtca 


acattgagct 


gcccggcggt 


cgcacggtgg 


cgtatgaata 


tgacatccag 


29820 


aaccgggtgg 


cgaagacgac 


agatccggaa 


ggccgggtga 


cgcagacgca 


gcggaacggc 


29880 
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gagcccgacg aaatcacgcg cacggcgccg gacgatgacg ctgtctggaa aacgcagtac 2994 0 
aacgcccacg gccagccagt gcaggagacg gacccggaag ggcgggtgac gcagtacgct 3 0000 
tacgatgaac aggggcagat gtgcagccgg acggatgcgg cgggcggcac gcagggcggc 30060 
gtgcggcggg agacgcagca gcgggatgcg cngggccgtc tgttacggac ggagaatgaa 3 0120 
cacggccagc ggacgtccag ccacaaccgg ctggaccaga taacggcagt gacgcncacg 3 0180 
cccacggagg cggggcaaca gcagcaccgg atgcaggccg acacggtgcg tttcgagtat 3 0240 
gaccgcagcg gctggctgac ggcggagcac gcggggaacg gtagcatatg ttatcagcgc 3 0300 
gacgcgctgg gcaacccgac ggacatcacg ctgccggacg ggcagcacct gacgcatctg 30360 
tatracggga gcgggcatct gccacagacg gcgctggacg gcctgacggt gagcgagtat 3 0420 
gagcgcgaca gcctgcaccg tcagataatg cgcacgcagg ggcagcttcc gacgtacagc 30480 
ggcratgacg acgacgggct gctgagccgg cagcgcagtc tggcgtccgg cagcgcccct 3 0540 
grccrtcccg gccagcgccc ggcgcggcag ggccgcgtga cgccgaggga ctatLaccgg 30600 
aacaaccacg gcgaggtggg cacgattgac gacggcctgc gtggcagcgt ggngtacagc 30660 
tacgacagaa gcggttacct gaccgggcgc tcaggtcaga tgtatgacca tgaccgtcat 30720 
tatzacgata aggcgggcaa cctgctggac aacgaagggc agggagcggt gatgagcaac 30780 
cggccgccgg gctgtggtcg tgaccgttac ggctataacg agtggggcga gctgaccacg 30840 
cggcgcgacc agcaactgga gtggaacgcg caggggcagc tgacgcgggt caccagcggc 30900 
aacacggaga cgcactacgg ccacgatgcg ctggggaggc gaacccgcaa ggcgacgcac 30960 
gggcggcaca cgggccatac ggcgcggagc cggacggact ttgtgtggga ggggctcagg 31020 
ccg-cgcagg agaacgtgca gcagcagggc cggcggacct atctgtacga tgcggaacag 31080 
ccgtacacgc cggtggcgag cgtgacgggg cggggagaaa gcaggcaggt gtggtattac 31140 
cacacggatg tgacgggcac gccgcaggag gtgacggcgg cggacggaac gctggtgtgg 312 00 
gcggggtata tcagggggtt tggagagaat gcggcggaca tcagcaacag cggggcgtac 31260 
ttccaccagc cgctgcggct gccggggcag tattttgacg acgagacagg gctgcattac 31320 
aacccgttca gatactatgc accggagtgt ggacggtttg tcagtcagga tccgatcggg 31380 
ctgaggggcg ggttaaacct ttatcagtat gcgccaaatc ctctcaaaca tatagaccca 31440 
ctrggtttaa ccgcgactgt tgggcgatgg atggggcctg cggaatatca gcaaacgctt. 31500 
gatactggga cagtagcaca aagttcaaca gggacaactc atgttgccta ccctgccgac 31560 
atagatgctt ttggtaagca agcaaaaaac ggtgctatgt atgttgaatt tgatgtgcct: 31620 
gaaaaatcat tagtacctac aaatgaagga tgggcaaaaa tagtagggcc agattctatc 31680 
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gaagggcgac cagctaaacg caaaggcttg 
ataactgtaa ggggcgagaa aattaatggg 
aattgtgggt gagcaaacat actgattata 
acagtattat tatagatttt gaagangacc 
acccaagctg tatgtcagaa gtaatggatg 
gaaacgaatt ttctacattt gatgaactcc 
ttaaataata gttggccggg taagaagtta 
ccatcaatcc ggagacgcgc taccggtacg 
cgtacgggcg gcacacgggc catacggcgc 
tcaggctgtt gcaggagaac gtgcagcagc 
aacagccgta cacgccggcg gcgagcgcga 
attaccacac ggacgcgacg ggcacgccgc 
tgtgggcggg gtatatcagg gggttcggag 
cgtaccttca ccagccgctg cggccgccgg 
attacaatct gctcagatat tatgcaccgg 
ttgggctggc ggggggggct gaatctttac 
gatcctttag gacttgctat cctggagcat 
ggatttgaaa atgcgggtat gacaaaccct 
aaaaccggta ccgttgttga gtttaaaggt 
ccccatgcag atatggacgt gacagcaggg 
gcaggaaaaa gaggttccgg aggagctaat 
catccgcatc gctctgactc taagggagat 
ccagaatcga actggagaat gcgattaaaa 
aacccgaaca aatcgtttct gagataaaag 
ggtggttatc attaaaacat agacaatatg 
aaaacatatc acaaatagta agtaaacaac 
tattcttgat cgctgatgaa gataatgagc 
ccctgcctga aattatagaa aattgcagat 
tatcctggcc tatatgtgaa aatgatcatg 
aaagcgcgag tgctctttag cgacatagtt 
ggcgcgattg tctggcaggg gaaacagcaa 
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cctgccccng aaatgccaac agcagaaaac 31740 
gaagctgaag caaaatgcta aacaaattta 31800 
cggtaattca taatgaaaat gatttatcct 31860 
ggtatatatc aagatttact gtacgggatg 31920 
tggatactgg tttatataaa ttaaacaaga 31980 
tggatatatt tgatgatttc atgataagca 32040 
actcttcccg gctgttttat tatctaaccc 32100 
atgcgctggg caggcgggtg agcaaggcga 3 2160 
ggagccggac ggactttgcg tgggaggggt 3222 0 
agggccggcg gacatatccg tacgacgcgg 32280 
cgggaaaggg agaaagcagg caggtgtggt: 3 234 0 
aggaggtgac ggcggcggac ggaacgccgg 3 24 00 
aaaatgcggc ggacatcagc aacagcgggg 32460 
ggcagtiantt tgacgacgag acagggcrgc 32520 
agtgtggacg gttcgtcagt caggaticcga 32580 
cagtatgcgc ctaatccgat tagatggatc 32640 
caatctaatt ctgatgcggc aaggagaacc 32700 
gaggatgtca ctttctcgaa agtcgatccc 32760 
ccaaatgggg ctaaagttgc ttatgatgca 32820 
catgataaac cacatgttgg ttggcaaccc 32880 
agaggtaata ctacttatga tggcccacaa 32 94 0 
gataaatgtt aaattcaaat atgtcrgaac 33000 
atctcggtat tcatgattat cgtgccgata 33060 
agatatatgt taatggtaat cctagaacct 33120 
tcttttctta taccgataat tctggatata 33180 
ccaatgaaag caatgtaacc aacaaacaca 33240 
aaatatatgt atataacgtt cctcttaact 33300 
attttgaata ttatgttgca gatcacgaac 33360 
gtgatttgat tgtatgctca accactaagt 33420 
gcccatattt aggcgttact agccgaagat 33480 
ttccgaggtc aggaagatag cataacccat 33540 
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taaccgggat agatccgcta gacctgaacc cagttgatgc gacaggttat agggtttatg 33600 
gttatttgct cccggagcaa ataaacccta ttacattggt actactaacg atatggtttg 33660 
acgaagggcc gagcattaaa gcactggcag gttatcaaaa gaaaatggaa ggatgctgcc 33720 
atttgatgaa aatgtaatcc aatggaaagn cagaggctac gagaaacatt acatagagaa 3 3780 
atataaaacc agaaccggaa ccacaggtga aaaaattccc tcaacaaata gagagaacaa 33840 
atataattca tttgatcatg ggcgaacaga tcccagcgca caagcattca aagactctta 33900 
aaatagtaag ggagttggtt ccggtggagg aaaatgcgga tgagtgatca agaattttgg 33 960 
ggctgtgata agaagtcgag aacaatgctg cgttttgtga agcccggaga catattttgt 34020 
tttaaattag atgaagatag atattgtttt gggcgaatta taacactaat gactgtcgga 34080 
catctttctg aattatttga tataattaaa aaaccccccg gaataacaga gncagaaatt 34140 
agtaacgcaa ggcgaaccac tgaaccaatt atagtggata catattcttt attcgacaag 34200 
aaattagaaa atggaagtga ctggagaatt attggtcatc aggttaatta caatccaaaa 34260 
aattcagatg gtatctattt tgcacttgga ataggtgatt cctgtaaaaa gaaagactgt 34320 
tacggaaatg attctctcat ttcagaaagt gagtggaaaa cacttcctaa attatctcct 34380 
aaagggggtt ttgatatcaa aaaacggctt gaaattgcct gaaaatgaaa ataaaaagcc 3444 0 
gggaaagatc ctttgtcttc ccggatttta ttatttaatc cccgttcacc acattattta 34500 
cccccgcctt aatatgcttc atcgactttt tcacctgata aagctcctcc cgtagatccc 34560 
ccacttcgtc cgtctctgca atcaggatca aacacccctc ggagatcttc acggtgacgc 34620 
cgtgcccggt ctcaaatccc gcttcttcct gccagtcacc ctnaaggrgc tggctgggga 34 680 
tttgtgagta acaggcggcc atgcaggttt: cgccgttgat atggcggacg crgacgccgg 34740 
aggcattcat atttgctgac taaataaatt cttatttatc cgccggacgc tggctgattg 34800 
tggagctcag ggtgagtgag tatgggcgcg acatcctggc accgcgcccc cctctccccg 34860 
gccagccccg gccggtgatg agcagccggc tgccggggcc gtattttgac gatgaaacgg 34920 
gcctgcatta aaatctgttc agatattatg taccggagtg tggctggttc gtcagccagg 34980 
atccaacagg gctgaaaggg ggatggaacc gatatcattc tccgctgaat cccattacag 35040 
atagtgatcc tcttggcctt attacttgtg gtgctgacag aggtgattct ggcaagttat 35100 
taagatgaaa aatggctgaa aataattgcg ttagcgaatg ctcatttatg cctttaccta 35160 
caaaaagtaa cggatttgct tgctggaatt gtgttaacga atgcaaataa aaccaatgct 35220 
gttgatgttt ttactacagg aataagtatg gttaacgata aggatacagc tatattaatt 35280 
agtaatttaa tgttgagatt cggtaaggag cttgatgaat ctgttgctgt tgttcagccc 35340 



Substitute sheet (Rule 26) 



WO 00/73336 PCT/SE00/01079 

21 



cgctgcgatg 


aggatgaact 


taatgtatat 


cgagaaacgg 


tcggtcttat 


catgggcgaa 


35400 


atgcttatta 


aaataatgaa 


tccattatat 


gaaaaacatc 


cagaaataaa 


accaaaagga 


35460 


ttgaaacaaa 


acatctggaa 


ccggatgaat 


aatgtgtaaa 


agccggaggg 


gttatctttt 


35520 


cccggctttt 


tattatcaat 


tactcattaa 


ctcctgttcc 


gctctcttgc 


gcttaatcac 


35580 


cggaatatct 


ccggtrattgc 


tcagcgcccc 


ggaaacgttc 


tcaaccactg 


ctctgcactc 


35640 


cgtttattaa 


tgcgggttac 


gcccatccct 


tcaatacagc 


caaagagtcc 


gcgggtatgc 


35700 


tgcggcgcga 


tcacgatgca 


acccctcatt 


acccgcacct 


tgacgggcat 


cccgttaata 


35760 


aaccccgccc 


gcggcatcca 


ttcaccgtac 


aaacgaactg 


aaagtctctc 


aacgcgtgag 


35820 


tatgtaagta 


tcccgcataa 


tcgagccatt 


cacatttaga 


gatcatccga 


cacaatcaa: 


35880 


ctgccaacgc 


aggagatcgc 


tatgcgtaaa 


gcccgtatta 


ctgcgcacca 


gatcatcgct 


35940 


gtgatcagac 


cagtcgaacc 


cggacggacc 


gtcaaagatg 


tctaccggga 


gcccggtact 


36000 


tctgaagcca 


ccagggacaa 


ctggaggtct 


ggatacggcg 


gcagggatac 


gcgnggaatc 


36060 


acaaaaggct 


ccaccgtatt 


tactgtctgc 


tcaagccgaa 


ctttcgccgc 


aagggtaaac 


36120 


aacggctgcc 


ggtacgcaat 


ccctcgccac 


tggtcacgcc 


ggaagcgctg 


aaccagagct 


36180 


ggtctgtggg 


cgtcgttttic 


gcacgttcaa 


tgntgttgan 


gactgtaatc 


grgaagcgtt 


36240 


gtcgattgaa 


atcgacctga 


atccgccagc 


tctgcgagtg 


gtccgtgtac 


tcgacaggat 


36300 


tacagcaacc 


gcggttatct 


ggccatgctg 


cgtatggata 


agggaccgga 


acttatctcg 


36360 


ctggcactgg 


ccgaatgggc 


aaagaaacat 


gcagtaaagc 


tggcgtttat 


ccagccgggt 


36420 


aagccgaaga 


aaaacgtttt 


catcacgcgc 


tttaaccgga 


cataccgtac 


agaaatactc 


36480 


aattctcatc 


tgttcagaac 


gccgaatgag 


9~9tgggaaa 


tcacggataa 


agggtcatca 


36540 


gaatataact 


gcgaacgccc 


acatgaatcg 


cggaacaaca 


tgataccgaa 


ggaataccgc 


36600 


caataacgtt 


atctggccgg 


aatcttaaaa 


atgcatggaa 


ctaaaacggg 


tctatttaca 


36660 


ggggcacctg 


cgatgaattt 


cgctgcactg 


aaaagcgata 


ccggatgaga 


gccgctccaa 


36720 


atcaatgcgc 


catgttcacg 


gggaggttgt 


gcgacgttcg 


cataatccag 


caagaactga 


36780 


aaggaagggg 


agagctttrt 


cacgcctgta 


caatcagtct 


ggcctgtgtc 


agtcagctct 


36840 


tagtgttgag 


actctcgttg 


gagcgttata 


actgcttttc 


tgtttcggaa 


aacaagattt 


36900 


cccattaaag 


atcttccctg 


cgaggaaaag 


ttaactaata 


atcttaccgt 


cgagctagga 


36960 


gatgtatgtt 


taaatacaaa 


caatgttgca 


acgatgcctg 


ataattatcc 


tctcttcgaa 


37020 


gataagtttc 


ccacacccag 


tgtagtaggt 


gtcatggtaa 


tgctatcacc 


tgaatgtaaa 


37080 


tggaaggtat 


aattgctttt 


tgactggcat 


tctattccac 


cctgacaaca 


cgatgccaac 


37140 


atcaacactg 


tctatatcgg 


caataacgca 


atttttttca 


gactaagagg 


cgcccrgana 


37200 
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tatagatttt tatgacatta cttatttgaa tcggcaacaa ataccaataa gtacaagctg 37260 

ttatcaccag ccacggattt tttacatacg gtaagatttg gtatggcgtt atgtattctg 37320 

gatgtgctgg attattttaa tttggtttaa aaaaggtggt tattcaa atg aaa age 37376 

Met Lys Ser 
1 

ata aaa aaa ttg att ate gca agt gcg teg age atg atg get get agt 37424 
lie Lys Lys Leu lie lie Ala Ser Ala Leu Ser Met Met Ala Ala Ser 
5 10 15 

tgt tat get ggc tea ttt ttg ccg aac tea gag caa caa aaa tea gtg 37472 
Cys Tyr Ala Gly Ser Phe Leu Pro Asn Ser Glu Gin Gin Lys Ser Val 
20 25 30 ~ 35 

gat att gtg ttt tec tct ccc caa gat tta ace gta teg ctt att cca 37520 
Asp lie Val Phe Ser Ser Pro Gin Asp Leu Thr Val Ser Leu lie Pro 

40 45 50 

gcg teg ggc tta aag get ggg aaa aat get cct age gcg aaa att gcg 37568 
Val Ser Gly Leu Lys Ala Gly Lys Asn Ala Pro Ser Ala Lys lie Ala 
55 60 65 

aag ctt gta gtt aat tct act act ctt aaa gaa ttc ggg gtc agg ggg 37616 
Lys Leu Val Val Asn Ser Thr Thr Leu Lys Glu Phe Gly Val Arg Gly 
70 75 80 

att tct aac aac gtg gta gac agt act ggc act gca tgg cgt gta get 37664 
lie Ser Asn Asn Val Val Asp Ser Thr Gly Thr Ala Trp Arg Val Ala 
85 90 95 

ggt aaa aat act ggt aaa gag ate ggt gtg ggc tta tea agt gac agt 37712 
Gly Lys Asn Thr Gly Lys Glu lie Gly Val Gly Leu Ser Ser Asp Ser 
100 105 110 115 

ctt aga aga tct gat age acg gaa aaa tgg aat ggg gtg aac tgg atg 37760 
Leu Arg Arg Ser Asp Ser Thr Glu Lys Trp Asn Gly Val Asn Trp Met 

120 " 125 130 

ace ttt aat age aat gac aca ctt gat att gtc ctg aca gga ccg gcg 37808 
Thr Phe Asn Ser Asn Asp Thr Leu Asp lie Val Leu Thr Gly Pro Ala 
135 140 145 

cag aat gtc aca get gac acg tac cca ata act tta gac gta gtg gga 37856 
Gin Asn Val Thr Ala Asp Thr Tyr Pro lie Thr Leu Asp Val Val Gly 
150 155 160 

tat caa cct taa tagtaaacaa ctattagtgt attgtgcctt gtttaaggcg 37908 
Tyr Gin Pro 
165 

caatacacat caaatcatct atttttcttt tacaattttt gat atg aaa ata gtt 37963 

Met Lys lie Val 
170 

aat ttt get gta atg gcg gta get ttg ttc gec act aat tct atg gtt 38011 
Asn Phe Ala Val Met Ala Val Ala Leu Phe Ala Thr Asn Ser Met Val 
175 180 18S 
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tea gta tat gec gtc aac cag caa tta aat tea gec act aaa tta ttc 38059 
Ser Val Tyr Ala Val Asn Gin Gin Leu Asn Ser Ala Thr Lys Leu Phe 
190 195 200 



age gtg aag ctg ggg get aca cga gtg att tat cac get ggt acg get 
Ser Val Lys Leu Gly Ala Thr Arg Val lie Tyr His Ala Gly Thr Ala 
205 210 215 

gga gcg acg etc teg gtg age aac ccg cag aat tac cct att ttg gtt 
Gly Ala Thr Leu Ser Val Ser Asn Pro Gin Asn Tyr Pro lie Leu Val 
220 225 230 235 

cag tct tea gtc aaa gca gca gac aaa agt teg cct get ccc ttt ttg 
Gin Ser Ser Val Lys Ala Ala Asp Lys Ser Ser Pro Ala Pro Phe Leu 

240 245 250 

gtg atg ccg cct eta ttt cgt tta gaa gca aac cag cag agt caa ctg 
Val Met Pro Pro Leu Phe Arg Leu Glu Ala Asn Gin Gin Ser Gin Leu 
255 260 265 

cgt att gtc cgt act ggt ggt gac atg cca acg gat cgt gag act tta 
Arg He Val Arg Thr Gly Gly Asp Met Pro Thr Asp Arg Glu Thr Leu 
270 275 280 



aca cag get aag ggc gcg ace ctt gac etc aat ttg tec ate aac gec 
Thr Gin Ala Lys Gly Ala Thr Leu Asp Leu Asn Leu Ser He Asn Ala 
300 305 310 315 



gtt att aca gac ttt ggt ggt gaa agt cat ccg ttc cac tac gtt ctt 
Val He Thr Asp Phe Gly Gly Glu Ser His Pro Phe His Tyr Val Leu 

400 405 410 



38107 



38155 



38203 



38251 



38299 



cag tgg gtc tgt ata aag gcg gta cca ccc gaa aat gaa ccg teg gat 38347 
Gin Trp Val Cys He Lys Ala Val Pro Pro Glu Asn Glu Pro Ser Asp 
285 290 295 



38395 



tgt gat aag ctg att ttc cgc ccg gat gec gtg aag ggg acg ccg gaa 3 8443 

Cys Asp Lys Leu He Phe Arg Pro Asp Ala Val Lys Gly Thr Pro Glu 

320 325 330 

gat gtt gca gga aat tta aga tgg gtg gag acg ggc aac aaa ctt aag 38491 

Asp Val Ala Gly Asn Leu Arg Trp Val Glu Thr Gly Asn Lys Leu Lys 
335 340 345 

gtg gag aac ccc ace ccg ttt tac atg aat tta gee tct gtc aca gta 

Val Glu Asn Pro Thr Pro Phe Tyr Met Asn Leu Ala Ser Val Thr Val 
350 355 360 

ggg 99 a aa 9 ccc att aca 999 9 a 9 tat 9 tc ccc ccc ttc 9 ct 9 ac 

Gly Gly Lys Pro He Thr Gly Leu Glu Tyr Val Pro Pro Phe Ala Asp 
365 370 375 

aaa aca eta aat atg cca ggt agt gee cat ggt gat ate gag tgg aga 38635 

Lys Thr Leu Asn Met Pro Gly Ser Ala His Gly Asp He Glu Trp Arg 

380 385 390 395 



38539 



38587 



38683 



aaa taa atccaggggc ttagcggcag aaa atg aag ttc aaa caa cct gec ttg 38736 
Lys Met Lys Phe Lys Gin Pro Ala Leu 

415 420 



:ta ctg ttc ate gcg gga gtg gtt cat tgc gca aat gcg cac act tac 



38784 
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Leu Leu Phe He Ala Gly Val Val His Cys Ala Asn Ala His Thr Tyr 
425 430 435 

aca etc gat gca tea atg ttg ggc gat gca gcg aaa ggg gtt gat atg 38832 
Thr Phe Asp Ala Ser Met Leu Gly Asp Ala Ala Lys Gly Val Asp Met 
440 445 450 

teg etc ttt aac cag ggg tea caa cag cca ggg act tat cgc gtg gac 3 8880 
Ser Leu Phe Asn Gin Gly Leu Gin Gin Pro Gly Thr Tyr Arg Val Asp 
455 460 465 

gtg atg gtg aat ggt aaa cgt gtc gac acc cgt gat gtg gtg ttc aaa 38928 
Val Met Val Asn Gly Lys Arg Val Asp Thr Arg Asp Val Val Phe Lys 
470 475 480 485 

ttg gaa aag gat ggg caa gga acg cct gtt ctg get cct tgt ttg acg 38976 
Leu Glu Lys Asp Gly Gin Gly Thr Pro Val Leu Ala Pro Cys Leu Thr 

490 495 500 

gtc agt cag ctt tea cgc tac ggc gta aaa acg gaa gat tac cct cag 39024 
Val Ser Gin Leu Ser Arg Tyr Gly Val Lys Thr Glu Asp Tyr Pro Gin 
505 510 " 515 

ttg tgg aaa gca gca aag ccc cca gat gag tgt gcg gat ctg acc gec 39072 
Leu Trp Lys Ala Ala Lys Pro Pro Asp Glu Cys Ala Asp Leu Thr Ala 
520 525 530 

att cca cag get aaa gcg gta ctg gat ate aat aat cag caa ctg caa 3 9120 
He Pro Gin Ala Lys Ala Val Leu Asp lie Asn Asn Gin Gin Leu Gin 
535 540 545 

ctg agt att ccg cag ttg gcg ttg cgt ccg gaa ttt aag ggg ate get 3 9168 
Leu Ser He Pro Gin Leu Ala Leu Arg Pro Glu Phe Lys Gly He Ala 
550 555 560 565 

cca gaa gat ctt tgg gat gat ggt att ccg gcg ttt ctg atg aac tac 39216 
Pro Glu Asp Leu Trp Asp Asp Gly He Pro Ala Phe Leu Met Asn Tyr 

570 575 580 

agt gcg agg aca acg cag acg gat tac aaa atg gat atg gtg ggg cgt 3 9264 
Ser Ala Arg Thr Thr Gin Thr Asp Tyr Lys Met Asp Met Val Gly Arg 
585 590 595 

gac aac tct tec tgg gta caa ctg caa ccg gga ate aat ata ggt gcg 39312 
Asp Asn Ser Ser Trp Val Gin Leu Gin Pro Gly He Asn lie Gly Ala 
600 605 610 

tgg cgt gtc cgc aat gcg acc age tgg cag egg agt agt caa ctg teg 39360 
Trp Arg Val Arg Asn Ala Thr Ser Trp Gin Arg Ser Ser Gin Leu Ser 
615 620 625 

ggg aag tgg cag gca gca tat acc tat get gag cgt gga ctg tac tea 3 9408 
Gly Lys Trp Gin Ala Ala Tyr Thr Tyr Ala Glu Arg Gly Leu Tyr Ser 
630 635 640 " 645 

eta aaa agt cgt ctg act ctg ggg caa aag act teg cag ggg gag ata 3 9456 
Leu Lys Ser Arg Leu Thr Leu Gly Gin Lys Thr Ser Gin Gly Glu He 

650 655 660 

ttt gat agt gtg cca ttt acc ggt gtg atg ttg gca teg gat gac aac 39504 
Phe Asp Ser Val Pro Phe Thr Gly Val Met Leu Ala Ser Asp Asp Asn 
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665 670 675 

ang gtg ccc tac agt gag egg cag ttt get ccg gta gtg cgt ggg att 3 9552 
Met Val Pro Tyr Ser Glu Arg Gin Phe Ala Pro Val Val Arg Gly lie 
680 685 690 

gec cgc acg cag get egg gtg gag gtc aaa cag aat ggt tac ace att 39600 
Ala Arg Thr Gin Ala Arg Val Glu Val Lys Gin Asn Gly Tyr Thr lie 
695 700 70S 

tac aac acc act gtg gcg ccc gga ccg ttt gca ctg egg gat ctg teg 3 9648 
Tyr Asn Thr Thr Val Ala Pro Gly Pro Phe Ala Leu Arg Asp Leu Ser 
710 715 720 725 

gta aca gac agt agt ggt gat ctg cat gtc acc gtg tgg gag gec gat 3 96 96 
Val Thr Asp Ser Ser Gly Asp Leu His Val Thr Val Trp Glu Ala Asp 

730 735 740 

ggc agt aca caa atg ttt gtg gtg ccg tat cag acc ccg gcg ata gca 39744 
Gly Ser Thr Gin Met Phe Val Val Pro Tyr Gin Thr Pro Ala lie Ala 
745 750 755 

ctg cac cag gga tat ttg aag tac age ctg ttg gcg ggc cga tac cga 3 9792 
Leu His Gin Gly Tyr Leu Lys Tyr Ser Leu Leu Ala Gly Arg Tyr Arg 
760 765 770 

teg tea gac tct gca acg gat aag egg cag ate gcg cag get acg ttg 3 9840 
Ser Ser Asp Ser Ala Thr Asp Lys Arg Gin lie Ala Gin Ala Thr Leu 
775 780 785 

atg tat ggt ctg ccg tgg aat etc act gca tac ggc ggt ata cag agt 39888 
Met Tyr Gly Leu Pro Trp Asn Leu Thr Ala Tyr Gly Gly lie Gin Ser 
790 795 800 805 

gca acg cat aat caa get gca ttg ctt ggt ttg ggg gga tct etc ggg 39936 
Ala Thr His Asn Gin Ala Ala Leu Leu Gly Leu Gly Gly Ser Leu Gly 

810 815 820 

egg tgg ggg agt tta tct gtc gat gga age gac aca cac agt cag cgt 39984 
Arg Trp Gly Ser Leu Ser Val Asp Gly Ser Asp Thr His Ser Gin Arg 
825 830 635 

cag ggg gag gcg gta cag caa gga gee tec tgg cga ctg cgt tac age 4 003 2 
Gin Gly Glu Ala Val Gin Gin Gly Ala Ser Trp Arg Leu Arg Tyr Ser 
840 845 850 

aac cag ctg act gcg acg ggg aca aat ttt ttt ctg acg aga tgg cag 40080 
Asn Gin Leu Thr Ala Thr Gly Thr Asn Phe Phe Leu Thr Arg Trp Gin 
855 860 865 

tat gee teg cag ggc tat aac acc eta tec gat gtg etc gac agt tat 40128 
Tyr Ala Ser Gin Gly Tyr Asn Thr Leu Ser Asp Val Leu Asp Ser Tyr 
870 875 880 885 

cga cat aat ggc aac cgt eta tgg teg tgg egg gaa aat ttg cag ccg 40176 
Arg His Asn Gly Asn Arg Leu Trp Ser Trp Arg Glu Asn Leu Gin Pro 

890 895 900 



age teg cgt act acc ctg atg ttg agt cag tea tgg ggg agg cat ttg 
Ser Ser Arg Thr Thr Leu Met Leu Ser Gin Ser Trp Gly Arg His Leu 
905 910 915 
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ggc aat ctg agt tea acc ggt tec cgt acc gac tgg cgt aat cgc ccc 40272 

Gly Asn Leu Ser Leu Thr Gly Ser Arg Thr Asp Trp Arg Asn Arg Pro 
920 925 * 930 

ggt cat gat gac age tac gga ctg agt tgg gga acc tct ate gga ggg 4 0 320 

Gly His Asp Asp Ser Tyr Gly Leu Ser Trp Gly Thr Ser He Gly Gly 
935 940 945 

ggc teg ctg tea ttg aac tgg aat caa aac aga acg ctg tgg cgc aat 40368 

Gly Ser Leu Ser Leu Asn Trp Asn Gin Asn Arg Thr Leu Trp Arg Asn 

950 955 960 96S 

ggc gcg cac cgt aaa gag aac ata acc age ctg tgg ttc agt atg cca 4 0416 

Gly Ala His Arg Lys Glu Asn He Thr Ser Leu Trp Phe Ser Met Pro 

970 975 980 

tta age cgc tgg acg ggg aat aat gta agt get agt tgg cag atg act 40464 

Leu Ser Arg Trp Thr Gly Asn Asn Val Ser Ala Ser Trp Gin Met Thr 
985 990 995 

tea cca tea cac ggt get cag acg caa caa gtg ggg gtc aac gga gag 40512 

Ser Pro Ser His Gly Gly Gin Thr Gin Gin Val Gly Val Asn Gly Glu 
1000 1005 1010 

gca ttc agt cag caa ctg gat tgg gag gtg cgt cag agt tac cgt gec 40560 

Ala Phe Ser Gin Gin Leu Asp Trp Glu Val Arg Gin Ser Tyr Arg Ala 
1015 1020 1025 

gat gec ccg cca ggt ggt ggt aat aac age gca ttg cac ttg gca tgg 40608 

Asp Ala Pro Pro Gly Gly Gly Asn Asn Ser Ala Leu His Leu Ala Trp 
1030 1035 1040 1045 

aat ggg gat tac ggc ctg tta ggt ggt gac tat age tac age egg gcg 40656 

Asn Gly Asp Tyr Gly Leu Leu Gly Gly Asp Tyr Ser Tyr Ser Arg Ala 

1050 1055 1060 

atg cgc cag atg gga gtc aat ate gcg gga ggt ata gtt ate cac cat 40704 

Met Arg Gin Met Gly Val Asn He Ala Gly Gly He Val lie His His 
1065 1070 1075 

cat ggt gtg acg ctg ggg caa cct ttg caa ggc tea gtg gcg ctg gtt 4 0752 

His Gly Val Thr Leu Gly Gin Pro Leu Gin Gly Ser Val Ala Leu Val 
1080 1085 1090 



gaa gcg cca ggg gec teg ggg gtg cca gtt ggc ggc tgg cct ggc gtt 
Glu Ala Pro Gly Ala Ser Gly Val Pro Val Gly Gly Trp Pro Gly Val 
1095 1100 1105 



cag gag aat aca gtc age etc gat ccg teg cga eta ccg gat gac gca 
Gin Glu Asn Thr Val Ser Leu Asp Pro Ser Arg Leu Pro Asp Asp Ala 

1130 " 1135 1140 



40800 



aag acg gat ttt cgt ggc gac acc aca gtg ggc aac ctg aac gtc tat 4 084 8 
Lys Thr Asp Phe Arg Gly Asp Thr Thr Val Gly Asn Leu Asn Val Tyr 
1110 1115 * 1120 1125 



40896 



gag gtc aca caa acc gat gtg cgc gtg gtg cca acc gaa ggg gcg gtg 40944 
Glu Val Thr Gin Thr Asp Val Arg Val Val Pro Thr Glu Gly Ala Val 
1145 1150 1155 
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gcg gaa gcg aag ttt cac act cgc acc ggg gcc agg gca ctg atg acg 40992 
Val Glu Ala Lys Phe His Thr Arg lie Gly Ala Arg Ala Leu Met Thr 
1160 1165 1170 

ctg aaa egg gaa gat ggt age gcc att cct ttc ggg gcg cag gtt aca 41040 
Leu Lys Arg Glu Asp Gly Ser Ala lie Pro Phe Gly Ala Gin Val Thr 
1175 1100 1185 

gtc aat ggg cag gat ggc agt get get ctg gtg gat act gat age cag 41088 
Val Asn Gly Gin Asp Gly Ser Ala Ala Leu Val Asp Thr Asp Ser Gin 
H90 11^5 1200 1205 

gtt tat etc act ggt ttg gcg gat aag ggc gaa ctg acg gtg aaa tgg 4113 6 
Val Tyr Leu Thr Gly Leu Ala Asp Lys Gly Glu Leu Thr Val Lys Trp 

1210 1215 1220 

gga gca cag caa tgt egg gtt aac tac cgc eta cct gcc cac aag gga 41184 
Gly Ala Gin Gin Cys Arg Val Asn Tyr Arg Leu Pro Ala His Lys Gly 
1225 1230 1235 

ate gcg ggc ttg tat caa atg age ggt etc tgc aga tag ccgattctga 41233 
lie Ala Gly Leu Tyr Gin Met Ser Gly Leu Cys Arg 

1240 * 1245 1250 

aggagagaat a atg tgg atg aaa ata cag cga gtg aaa acg gtt ate tat 41283 
Met Trp Met Lys lie Gin Arg Val Lys Thr Val lie Tyr 

1255 1260 



age gta age tta ctg gtc get gcc agt age ttg gtg ccg ata gcg aac 

Ser Val Ser Leu Leu Val Ala Ala Ser Ser Leu Val Pro lie Ala Asn 

1265 1270 1275 

gcc gca gaa aaa ctt cag aca acg eta cgt gta ggt act tac ttt egg 

Ala Ala Glu Lys Leu Gin Thr Thr Leu Arg Val Gly Thr Tyr Phe Arg 

1280 1285 1290 1295 

get ggg cac gtg cca gat ggg atg gtg ctt gcg caa ggc tgg gtg act 

Ala Glv His Val Pro Asp Gly Met Val Leu Ala Gin Gly Trp Val Thr 

1300 1305 1310 



gcg ggt aac acg cct acc gta ttg ctg ctg age ggg caa cag gat cct 
Ala Gly Asn Thr Pro Thr Val Leu Leu Leu Ser Gly Gin Gin Asp Pro 
1330 1335 1340 



agt ttc agt gtg gtc gtt gat ggc aat cag gaa gtg cct gcg gac acc 
Ser Phe Ser Val Val Val Asp Gly Asn Gin Glu Val Pro Ala Asp Thr 

1380 1385 1390 

tgg acg ctg gat ttt aag gcc tgt gca ttg gcg cag gag gat acg tag 



41331 



41379 



41427 



tat cac ggc agt cac age ggg ttt egg gta tgg age gat gag caa aag 41475 
Tyr His Gly Ser His Ser Gly Phe Arg Val Trp Ser Asp Glu Gin Lys 
1315 1320 1325 



41523 



cgc cat cac att cag gtt cgc ctg gag ggc gag ggg tgg caa cca gat 41571 

Arg His His lie Gin Val Arg Leu Glu Gly Glu Gly Tr? Gin Pro Asp 
1345 1350 1355 

acg gtg agt ggt cgt ggc gcc att tta aga acc get gca gat aac gcc 41619 

Thr Val Ser Gly Arg Gly Ala lie Leu Arg Thr Ala Ala Asp Asn Ala 
1360 1365 1370 1375 



41667 



41715 
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Trp Thr Leu Asp Phe Lys Ala Cys Ala Leu Ala Gin Glu Asp Thr 
1395 1400 1405 

ccgtctgttc cactcatact tcctgtatca gtgaataagg cgaatactgt atatcaggtt 41775 

cgaggatgta ttgaaaattt actcaaggca aatatggtta gcaatcttca ttgcagaaaa 41835 

atcaattgct gattgatttt aaaatctgaa atccatcttc tttgtaggga gaggaatatt 41895 

atgtttcgta aaataattta acggacactg atgcctttat tatttctgtt ttgtggattt 41955 

gtttgattca ctcttttgtg agcggggctt tataagcccc tggtataagg tttgtttatc 42015 

tgtcaggtat tcatatgtga tatttaagat tttatgttta gggggctttt tcttgtcccc 42075 

tcaggcgtaa aaataattta ttttttacat aaaggaataa agcatatgtc ttatgcacga 4213 5 

catttaccgg tattaatgta tcaccatgtc agtgataaac ccggacagat aaccttatct 42195 

ccccgtacgt tccgggcgca gatgaaatgg ctggccgaat ctggctggaa aaccgttact 4225 5 

gcctgcagag gtggaagctt tttatcatgg tgcaagattg cctcgtaaaa gcgtcatgct 42315 

gacctttgat ggcggctggc tggataactg gttgcaggtt tttccggtgc tgcaggagtt 42375 

taatctgcat gcgcatctct ttcttgtgac cagtttgatc agtgacggac cggtccgtat 42435 

tcctgcaggc gaaccggtgt actctcatga tgagtgtcaa atgctggtta aacaaggccg 42495 

ggctgatgag gtcatgctgc gctggtcaga ggtccgggag atgcacctca gtggccttgt 42555 

tgagtttcac tcgcacacgc acacccaccg acgctgggac cagaagcctg tgtcccgtaa 42615 

tccgtcggat ttgcttcgtg tcgatattct tcttagtcgt aagcggatga gggagatgct 42675 

gggttattgc agtcagcatc tgtgctggcc tgagggctgg tattgttctg actatattca 42735 

tgtggctgaa gagttggggt tcacatacct gtataccaca gaaaggcgta tgaacaatcc 42795 

agtcatcggt tcacagcgta ttggtcgtat caacgcaaag gagcgaaaga atgtgggctg 42 855 

gctgaaacgt cgtctgtttt atcacaccac gcccggattt tcttcgctgc tggcccggca 42915 

taagggggca cgtcggatag ctgactgagc cagagaccag gatgaaagtg ctgcataccg 42 975 

aatcctcccc catcatcggc gggcaggggt tgtaggctat atcccaaatg atggtgctga 43035 

ttttgagcag aaataagata acaccggagg ccgtgataga cccggttgcg cagaatctgg 43 095 

aggcagcagg tcactggcgc agggcttcta cccgctggct tttggttatg ggagattttg 43155 

aatgtacaga ggcccagcgg gagtggttgt tgttgcgccg gaattattgc cttgcgcaga 43215 

tatcttcccc tggcgccagt aaagctggat atcagcgacg tgacgaaatg agaggagcct 43275 

gcacataggt ttgtgttttg agtaagtgtc ttatttaaac cgtctgttct gtttcctccg 43335 

ctctcacaaa taatgtcgag ccgggtgggg gactcaagta agaataatct ggcgatgttt 43 395 

tgcttgtttc cacgggatac tttgttaggt gaacgataca attaatgcgc tccagactgc 43455 
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gccacagaaa aggagaatgc ctttgcagaa cgggcgcatg acaatgaacc actttattat 43 515 
tggtatctgg ttcggcgaca accttgacga tcacgtctga gatagggtat ttacactgag 43575 
tggtaaacag gttcaattag taaccggaga tggatgcaaa atcatgatcg attcagatgc 43635 
tattttgcag ccaatagatt ttttattaag atgatatcaa ggacattgag gcacacaacg 43695 
acgcatcagt aagtcgtcga tagctcattt gatataagaa ttccttttat caacggaaga 43755 
taacgatgga actgatcaat aatcgtggta tgcgagactg gatgatattt attaaagtgg 43815 
cggaagtagg gaatctttcc cgggctgcgc gggaattaga tattagcatt tctgctgtca 43875 
gtaaatcgct tagtcgcctt gagaattcta ttgaggtcac tttacttcgg cgcgattzcac 43935 
atcacctaga actgactgga gctggtcaga cagcctatgc aagcatgaaa aggataacat 43995 
cttcctttca gtccttgctg gatgaattgc gaaatccgga taaaattatc agagggagca 44055 
taaaattctc ggctccggct atcgtccgtg agtttcttgc caataagtgg atatgggaat 44115 
ttacagctag ctatccggat acaaaaatct acctggactc acgagagcgt agcgactcrc 44175 
ttagtaaatc cctggagttt gatgagctgg tttttaaaag tggcataatc gaaagtgagg 4423 5 
atctcgtgta tcgaaagata agccctttaa agntggtrct ttgtgcgagt ccgaaatata 44295 
tcagaaaata tggcaggatc tcacaccctg gcgatttgga aaatcacatt attgtgggcc 44355 
ttcacaacca tggtctttcc ggacctctta ctcttttccg tcaggatgaa tcatacacta 44415 
ttagtggcgc tgttaatgtt catttatctt ccaataatct tttgagtgtt cttaatttgg 44475 
ttttagaagg aaagggtatc aacctcatga ctccggcctg gcttgccacc aaatacttaa 44535 
aaaataatga acttgaaatt atacttcctg aatggagggc tccagatctc cccatttatc 44595 
ttgcatggcg tcatcgccag tattattctc ctttatttca acgctttctg tcttttatcg 44655 
aagataaatg gaataatcgc ccacaaattg attttctgaa tgatgattaa cccgtttgga 44715 
atggttttga tacgttcctg acttaaaccc acatgatgac tgaattgagg catcgagaca 44775 
tgcgactggt cagccagtcg tcttttgacg atgcccaata caaaacatga tgccgactga 4483 5 
cggaaatgat aatacgcgga aacaggacgg ggctgttttt gggcagccgg aagttaagcc 448 95 
cataccagaa acgttgcagt gtactgaaaa acggcgccag gttgcacctg ttcaaagatic 44 955 
ttccgaaggc gcaggagtat tcattactga tatcctccat tgcgccttcg ggaacccaca 45015 
ggaccagcta ttttaccgat agtgtttaaa aggcgtaagt aatgccgagc atgaagtcat 45075 
tggaggcagc ctttgtgtct gcatcataag cggtatgttc atcaccagca tagcgatcct 45135 
ttgaaatgct tactttgcca gcattaatgt atttataact ggcgtcaatc ataatattat 45195 
ctgtcacagc atattttgca ccgatacctg cgccccaggc aaagttattt tttgaagcag 45255 
acagagtttc attaatacca aaaccaacag gaatggcgtt attacttagc ttcacangag 45315 
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cgaggccaac gcctgcgctg atatagggag taaatgccgt actattgtga aaatcataat 4 5375 
agccattaac catgtaagtg gtcattcgga cctgattttt tacatttatg tgtactggat 45435 
caccaaatgc aataatatcc tgcccgcctt tagcatccgt ctcacctctg aaagtggtat 4 54 95 
ccagttctaa acgtactgga agctggaatg gatcataaaa gtcataaccg atagcaaccc 4 5555 
cgccgccaaa aacgcctttg gtacggtcag gtaacgttgc atgaccatta actatctcat 45615 
cctggctgaa ggttgagttg attccataga cattgactac ggatgtcccc gctttcccgg 4 5675 
tgatatagat cccttctttt gctgatgcag tagcggacca ggctaccaca aggggaatga 4573 5 
tgcagactgc gaaaaagttt ttcatttcag aacctgcctt aatattgggc taaaagacaa 45795 
gtttcacggt atagggtgtg atataacgat tacataaacg aagcccaaaa aacggtctat 45855 
tgtaacgctg ggttttctgt aagcgggtaa aaaatgagat gaagatttta aataacaata 45915 
cgataatcgt cggtatggaa atccatctcc tcgccaaact gccccacgta cggtttcact 45975 
tctacgttat gtaacgggta gtgcgagatg gagcgatgct gtaagaaaaa gatgaagatg 4 603 5 
aatttgtacc cgacctggat aaagcccgtt atcccggaat aacgggcaaa aatatttact 46095 
caagtgcctg ggcgagatct tgttgtacct gttgacgctg ttctggtgtt aagactttgc 46155 
ttaaatcaaa ataatattta acccgacaat agcgagcctg ttgttctatg ttactgaagg 46215 
ctgcaagctg ctgttttacg gcggcgtcat cccatttacc ggatttaatc acctctatca 46275 
gcgcaccgtc tttaattccc ttcatagaaa tctgactgac gtcggtttcc agttgttggt 46335 
gaagtttttt gatccgggta atctgatcgt ttgtcagctt cagatgctgg acaataggat 463 95 
cctgggcggg caggggagga tcggggacag cggtggcgaa agcgccaaaa gaaacgcccg 4 6455 
ccagagtcgc tgccagtaaa gttgcgcgta caaagttttt catgaagata tcctgataag 46515 
ggagtgatta accgttttta ttacccacga atggcgagca attatcttag agcctatccc 46575 
agtagggcta ttttacttgc cattttggac ctgggcagtg ctcgccaaaa cgcgttagcg 46635 
ttttgaacgc cgctagcggc ggcccgaagg gcgagcgtag cgagtcaaac ctcacgtact 46695 
acgtgtacgc tccggttttt gcgcgctgtc cgtgtccaaa ctggctgcgc caataacgcc 46755 
tggtgggata ggctcttagt cagaatacgt tgcctgccac attacgccac gcgaatttgt 46815 
tttacggaga gttacggagt gaaacaatcc cgccgcggtg agcggcaggt tgctt 46870 

<210> 2 
<211> 166 
<212> PRT 

<213> Salmonella typhimurium 
<400> 2 

Met Lys Ser lie Lys Lys Leu lie lie Ala Ser Ala Leu Ser Met Met 
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.1 5 10 15 

Ala Ala Ser Cys Tyr Ala Gly Ser Phe Leu Pro Asn Ser Glu Gin Gin 
20 25 30 

Lys Ser Val Asp lie Val Phe Ser Ser Pro Gin Asp Leu Thr Val Ser 
35 40 45 

Leu lie Pro Val Ser Gly Leu Lys Ala Gly Lys Asn Ala Pro Ser Ala 
50 55 60 

Lys lie Ala Lys Leu Val Val Asn Ser Thr Thr Leu Lys Glu Phe Gly 
65 70 75 80 

Val Arg Gly lie Ser Asn Asn Val Val Asp Ser Thr Gly Thr Ala Trp 

85 90 95 

Arg Val Ala Gly Lys Asn Thr Gly Lys Glu lie Gly Val Gly Leu Ser 
100 105 110 

Ser Asp Ser Leu Arg Arg Ser Asp Ser Thr Glu Lys Trp Asn Gly Val 
115 120 125 

Asn Trp Met Thr Phe Asn Ser Asn Asp Thr Leu Asp lie Val Leu Thr 
130 135 140 

Gly Pro Ala Gin Asn Val Thr Ala Asp Thr Tyr Pro lie Thr Leu Asp 
145 150 155 160 

Val Val Gly Tyr Gin Pro 

165 



<210> 3 
<211> 245 
<212> PRT 

<213> Salmonella typhimurium 
<400> 3 

Met Lys lie Val Asn Phe Ala Val Met Ala Val Ala Leu Phe Ala Thr 
15 10 15 

Asn Ser Met Val Ser Val Tyr Ala Val Asn Gin Gin Leu Asn Ser Ala 
20 25 30 

Thr Lys Leu Phe Ser Val Lys Leu Gly Ala Thr Arg Val lie Tyr His 
35 40 45 

Ala Gly Thr Ala Gly Ala Thr Leu Ser Val Ser Asn Pro Gin Asn Tyr 
50 55 60 

Pro lie Leu Val Gin Ser Ser Val Lys Ala Ala Asp Lys Ser Ser Pro 
65 70 75 80 

Ala Pro Phe Leu Val Met Pro Pro Leu Phe Arg Leu Glu Ala Asn Gin 

85 90 95 

Gin Ser Gin Leu Arg lie Val Arg Thr Gly Gly Asp Met Pro Thr Asp 
100 105 110 

Arg Glu Thr Leu Gin Trp Val Cys lie Lys Ala Val Pro Pro Glu Asn 
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Glu Pro Ser Asp 
130 

Ser lie Asn Ala 
145 

Gly Thr Pro Glu 



Asn Lys Leu Lys 
180 

Ser Val Thr Val 
195 

Pro Phe Ala Asp 
210 

lie Glu Trp Arg 
225 

His Tyr Val Leu 



120 

Thr Gin Ala Lys 
135 

Cys Asp Lys Leu 
150 

Asp Val Ala Gly 
165 

Val Glu Asn Pro 



Gly Gly Lys Pro 
200 

Lys Thr Leu Asn 
215 

Val lie Thr Asp 
230 

Lys 
245 



32 



Gly Ala Thr Leu 
140 

lie Phe Arg Pro 
155 

Asn Leu Arg Trp 
170 

Thr Pro Phe Tyr 
185 

lie Thr Gly Leu 



Met Pro Gly Ser 
220 

Phe Gly Gly Glu 
235 



125 

Asp Leu Asn Leu 



Asp Ala Val Lys 
160 

Val Glu Thr Gly 
175 

Met Asn Leu Ala 
190 

Glu Tyr Val Pro 
205 

Ala His Gly Asp 



Ser His Pro Phe 
240 



<210> 4 
<211> 836 
<212> PRT 
<213> Salmonell 

<400> 4 

Met Lys Phe Lys 
1 

His Cys Ala Asn 
20 

Asp Ala Ala Lys 
35 

Gin Pro Gly Thr 
50 

Asp Thr Arg Asp 
65 

Pro Val Leu Ala 



Val Lys Thr Glu 
100 

Asp Glu Cys Ala 
115 

Asp He Asn Asn 
130 

Arg Pro Glu Phe 



typhi murium 



Gin Pro Ala Leu 
5 

Ala His Thr Tyr 



Gly Val Asp Met 
40 

Tyr Arg Val Asp 
55 

Val Val Phe Lys 
70 

Pro Cys Leu Thr 
85 

Asp Tyr Pro Gin 



Asp Leu Thr Ala 
120 

Gin Gin Leu Gin 
135 

Lys Gly He Ala 



Leu Leu Phe He 
10 

Thr Phe Asp Ala 
25 

Ser Leu Phe Asn 



Val Met Val Asn 
60 

Leu Glu Lys Asp 
75 

Val Ser Gin Leu 
90 

Leu Trp Lys Ala 
105 

He Pro Gin Ala 



Leu Ser He Pro 
140 

Pro Glu Asp Leu 



Ala Gly Val Val 
15 

Ser Met Leu Gly 
30 

Gin Gly Leu Gin 
45 

Gly Lys Arg Val 



Gly Gin Gly Thr 
80 

Ser Arg Tyr Gly 
95 

Ala Lys Pro Pro 
110 

Lys Ala Val Leu 
125 

Gin Leu Ala Leu 



Trp Asp Asp Gly 
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145 150 155 160 

He Pro Ala Phe Leu Met Asn Tyr Ser Ala Arg Thr Thr Gin Thr Asp 

165 170 175 

Tyr Lys Met Asp Met Val Gly Arg Asp Asn Ser Ser Trp Val Gin Leu 
180 185 190 

Gin Pro Gly He Asn He Gly Ala Trp Arg Val Arg Asn Ala Thr Ser 
195 200 205 

Trp Gin Arg Ser Ser Gin Leu Ser Gly Lys Trp Gin Ala Ala Tyr Thr 
210 215 220 

Tyr Ala Glu Arg Gly Leu Tyr Ser Leu Lys Ser Arg Leu Thr Leu Gly 
225 230 235 240 

Gin Lys Thr Ser Gin Gly Glu He Phe Asp Ser Val Pro Phe Thr Gly 

245 250 255 

Val Met Leu Ala Ser Asp Asp Asn Met Val Pro Tyr Ser Glu Arg Gin 
260 265 270 

Phe Ala Pro Val Val Arg Gly He Ala Arg Thr Gin Ala Arg Val Glu 
275 280 285 

Val Lys Gin Asn Gly Tyr Thr He Tyr Asn Thr Thr Val Ala Pro Gly 
290 295 300 

Pro Phe Ala Leu Arg Asp Leu Ser Val Thr Asp Ser Ser Gly Asp Leu 
305 310 315 320 

His Val Thr Val Trp Glu Ala Asp Gly Ser Thr Gin Met Phe Val Val 

325 330 335 

Pro Tyr Gin Thr Pro Ala He Ala Leu His Gin Gly Tyr Leu Lys Tyr 
340 345 350 

Ser Leu Leu Ala Gly Arg Tyr Arg Ser Ser Asp Ser Ala Thr Asp Lys 
3S5 360 365 

Arg Gin lie Ala Gin Ala Thr Leu Met Tyr Gly Leu Pro Trp Asn Leu 
370 375 380 

Thr Ala Tyr Gly Gly He Gin Ser Ala Thr His Asn Gin Ala Ala Leu 
385 390 395 400 

Leu Gly Leu Gly Gly Ser Leu Gly Arg Trp Gly Ser Leu Ser Val Asp 

405 410 415 

Gly Ser Asp Thr His Ser Gin Arg Gin Gly Glu Ala Val Gin Gin Gly 
420 425 430 

Ala Ser Trp Arg Leu Arg Tyr Ser Asn Gin Leu Thr Ala Thr Gly Thr 
435 440 445 

Asn Phe Phe Leu Thr Arg Trp Gin Tyr Ala Ser Gin Gly Tyr Asn Thr 
450 455 460 

Leu Ser Asp Val Leu Asp Ser Tyr Arg His Asn Gly Asn Arg Leu Trp 
465 " 470 475 480 
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Ser Trp Arg Glu 



Ser Gin Ser Trp 
500 

Arg Thr Asp Trp 
515 

Ser Trp Gly Thr 
530 

Gin Asn Arg Thr 
545 

Thr Ser Leu Trp 



Val Ser Ala Ser 
580 

Gin Gin Val Gly 
595 

Glu Val Arg Gin 
610 

Asn Ser Ala Leu 
625 

Gly Asp Tyr Ser 



Ala Gly Gly He 
660 

Leu Gin Gly Ser 
675 

Pro val Gly Gly 
690 

Thr Val Gly Asn 
70S 

Pro Ser Arg Leu 



Val Val Pro Thr 
740 

He Gly Ala Arg 
755 

He Pro Phe Gly 
770 

Ala Leu Val Asp 
785 



Asn Leu Gin Pro 
485 

Gly Arg His Leu 



Arg Asn Arg Pro 
S20 

Ser He Gly Gly 
535 

Leu Trp Arg Asn 
550 

Phe Ser Met Pro 
565 

Trp Gin Met Thr 



Val Asn Gly Glu 
600 

Ser Tyr Arg Ala 
615 

His Leu Ala Trp 
630 

Tyr Ser Arg Ala 
645 

Val He His His 



Val Ala Leu Val 
680 

Trp Pro Gly Val 
695 

Leu Asn Val Tyr 
710 

Pro Asp Asp Ala 
725 

Glu Gly Ala Val 



Ala Leu Met Thr 
760 

Ala Gin Val Thr 
775 

Thr Asp Ser Gin 
790 



Ser Ser Arg Thr 
490 

Gly Asn Leu Ser 
505 

Gly His Asp Asp 



Gly Ser Leu Ser 
540 

Gly Ala His Arg 
555 

Leu Ser Arg Trp 
570 

Ser Pro Ser His 
585 

Ala Phe Ser Gin 



Asp Ala Pro Pro 
620 

Asn Gly Asp Tyr 
635 

Met Arg Gin Met 
650 

His Gly Val Thr 
665 

Glu Ala Pro Gly 



Lys Thr Asp Phe 
700 

Gin Glu Asn Thr 
715 

Glu Val Thr Gin 
730 

Val Glu Ala Lys 
745 

Leu Lys Arg Glu 



Val Asn Gly Gin 
780 

Val Tyr Leu Thr 
795 



Thr Leu Met Leu 
49S 

Leu Thr Gly Ser 
510 

Ser Tyr Gly Leu 
525 

Leu Asn Trp Asn 



Lys Glu Asn He 
560 

Thr Gly Asn Asn 
S75 

Gly Gly Gin Thr 
590 

Gin Leu Asp Trp 
605 

Gly Gly Gly Asn 



Gly Leu Leu Gly 
640 

Gly Val Asn He 
655 

Leu Gly Gin Pro 
670 

Ala Ser Gly Val 
685 

Arg Gly Asp Thr 



Val Ser Leu Asp 
720 

Thr Asp Val Arg 
735 

Phe His Thr Arg 
750 

Asp Gly Ser Ala 
765 

Asp Gly Ser Ala 



Gly Leu Ala Asp 
800 
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Lys Gly Glu Leu Thr Val Lys Trp Gly Ala Gin Gin Cys Arg Val Asn 

805 810 815 

Tyr Arg Leu Pro Ala His Lys Gly lie Ala Gly Leu Tyr Gin Met Ser 
820 825 830 

Gly Leu Cys Arg 
835 



<210> 5 
<211> 156 
<212> PRT 

<213> Salmonella typhimurium 
<400> 5 

Met Trp Met Lys lie Gin Arg Val Lys Thr Val lie Tyr Ser Val Ser 
15 10 15 

Leu Leu Val Ala Ala Ser Ser Leu Val Pro lie Ala Asn Ala Ala Glu 
20 25 30 

Lys Leu Gin Thr Thr Leu Arg Val Gly Thr Tyr Phe Arg Ala Gly His 
35 40 45 

Val Pro Asp Gly Met Val Leu Ala Gin Gly Trp Val Thr Tyr His Gly 
50 55 60 

Ser His Ser Gly Phe Arg Val Trp Ser Asp Glu Gin Lys Ala Gly Asn 
65 70 75 80 

Thr Pro Thr Val Leu Leu Leu Ser Gly Gin Gin Asp Pro Arg His His 

85 90 95 

He Gin Val Arg Leu Glu Gly Glu Gly Trp Gin Pro Asp Thr Val Ser 
100 105 110 

Gly Arg Gly Ala He Leu Arg Thr Ala Ala Asp Asn Ala Ser Phe Ser 
115 120 125 

Val Val Val Asp Gly Asn Gin Glu Val Pro Ala Asp Thr Trp Thr Leu 
130 135 140 

Asp Phe Lys Ala Cys Ala Leu Ala Gin Glu Asp Thr 
145 150 155 
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SEQUENCE LISTING NO. 2 

<110> Folkesson, Anders 

<120> The complete sequence of the tcf insert of Salmonella 
enterica serovar Typhi . 

<130> The tcf insert in Salmonella typhi 

<140> 
<141> 



<160> 6 



<170> Patentln Ver. 2.1 



<210> 1 

<211> 9253 

<212> DNA 

<213> Salmonella typhi 



<220> 
<221> CDS 

<222> (1898) . . (2608) 

<223> tcfA putative fimbrial subunit 

<220> 
<221> CDS 

<222> (2659) . . (3234) 

<223> tcfB putative fimbrial subunit 

<220> 
<221> CDS 

<222> (3360) . . (6029) 

<223> tcfC putative fimbrial subunit 

<220> 
<221> CDS 

c222> (6052) . . (7131) 

<223> tcfD putative fimbrial subunit 

<220> 
<221> CDS 

<222> (7264) . . (7719) 

<223> tinR putative transcriptional regulator 
<400> 1 

tgtaagcatc ccgcataatc gagccattca catttagaga tcatccggca taaccaaccc 60 



gccaacgcag gagatcgctg tgcgtaaagc ccgtattact gcgcaccaga tcatcgctgt 120 
gattagatca gttgaatccg gacggactgt taaagatgtc taccgggagg ccggtatttc 180 
tgaagccacc agggacaact ggaagtctgg atacggcggc atggaagctt ccgatattaa 240 
atcttgagga tgtcaacgcc aggatttatg gtcgtttttc gctattttta atatccgctg 300 
tttgatcact tctgctgtcc gctttccgcc atttcatctt cactgattgg cgttgcgctc 360 
ttggtcagcg ccccgacttt gcgtttttcc ttcagtcggt aactttctcc tttgatattc 420 
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agtgtggttg agtgatgcag tagccgatcc aggatcgctg tcgccagcac gttatcaccg 480 

aacatctctc cccagtctgc gaagcctttg ttcgacgtca ggataatgct cgctttttca 540 

caccgccggc tcagcagtcg gaagaacaga ccggcttccc cactggccat tggcaggtaa 600 

cctatttcgt ccaggatcag cccccgcgca tagctcagtt gttgtagctg gcgctccagg 660 

cggttttcca gcttcgcttt catcagcgtc taccaacctg tcctgcggca tgaacaacac 720 

ccgacggccc acatctgccg ctttcacacc gggggcagcg gccaggtggg ttttctccac 780 

tccgggggcc ccagcaggat cacattctcg cagcgctcca cgaacgccag accggccaac 84 0 

tcccggacga ccctacgatc gatgcctggc tggaagctga agccgaaccg ctccagcgtt 900 

ttgactcacg gcagacgagc ctgtttcagc cgggattcca ttccgcgctg atgtctgccg 960 

ttctattcct gccgcagcgc catgcacagg aatgtcgact tattttgggc tgacagagcc 1020 

attggtcgtt aagacttgta tattcatgtt gcctacrcgc acaaaagggg gaaaggaacc 1080 

gcaaacaaca ctgacagcta agggaccggc accaatntga atggtatttt tcacaaca:: 1140 

atgttttaaa aggtgaattg tattttcatg gtgagttgtt ccgtttcaca ttgttgtttg 1200 

tggaatatct agctaacact caacctaaaa acagcgtacc acactgacca cactttgccg 1260 

gagaggtgta aacatcaagc gtaaccatat gacgcatata agtttttgtt tgctgatatc 1320 

atatttttaa cagatattga gcaagtaaat ccgacctaac tgtaacttta gttttctaaa 1380 

ttaattactt tgcttacttt tttttacctg gttttgcaaa acctcacgat gatcagtcta 1440 

tttgtggctt tgtttaaggt ttatttggtt ttgtagtctg aactgtatcc ccctcctgga 1500 

gggctttctt tagattctct gtctcctacg ttgttatgtt acgtatttgt tgctttgaag 1560 

gagggggaaa tacagtccca tttatctgag taagtcaggt acacagtaac aac::tcr:a 1620 

tgaagaattt ccaaaattt: tactgcggcg ttattaattg ttcagcgatt cctacagatc 1680 

tgtcgttcgc ttttggtgaa tgaaatccgt ggacttttat ttaccaattc tttctttcct 1740 

gaaaaaaaca gaggtattga gcgaaaaacc ttattccgta cgatgccctc cacacaaaat 1800 

gtattaacac tgaatcgtaa tttgcttctt tatgctgata accttctgtc tatgccaata 1860 

ctaaaatcca gatgactttt atacggtaaa atctggt atg aac ttt aaa gac ace 1915 

Met Asn Phe Lys Asp Thr 
1 5 

ctt ccc ggg gtg tct etc tgt gtc get atg ttt gca tgt ggt cat gec 1953 
Leu Pro Gly Val Phe Leu Cys Val Ala Met Phe Ala Cys Gly His Ala 
10 15 20 

agg gcg aat atg etc gtt tat ccc atg gcg gca gaa att aat agt age 2011 
Arg Ala Asn Met Leu Val Tyr Pro Met Ala Ala Glu lie Asn Ser Ser 
25 30 3S 
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cgc gaa gag gcc acc teg ctg ttc gec tat tec aaa tea gat cat gtg 2059 
Arg GIu GIu Ala Thr Ser Leu Phe Val Tyr Ser Lys Ser Asp His Val 
40 45 50 

caa tar act cga aca aga ate atg cgt att gaa cac cec ggt atg cca 2107 
Gin Tyr He Arg Thr Arg He Met Arg He GIu His Pro Gly Met Pro 
55 60 65 70 

cag gag aag gag gta cca gca ggg aat gat ata gag aca gga ctt gtt 2155 
Gin GIu Lys GIu Val Pro Ala Gly Asn Asp He GIu Thr Gly Leu Val 

75 80 85 

gtc tec ceg gag aaa ttt get ctt tec ccg gga aca aaa aaa aca ata 2203 
Val Ser Pro GIu Lys Phe Ala Leu Ser Pro Gly Thr Lys Lys Thr He 
90 95 100 

cgt gtt ate agt act cag gca ccg gaa aga gag gaa gcc tag egg gta 2251 
Arg Val He Ser Thr Gin Ala Pro GIu Arg GIu GIu Ala Trp Arg Val 
105 110 us 

tac ttc gag get gtt cct gaa ctg gaa gat gat cca cag gca ggc gga 2299 
Tyr Phe GIu Ala Val Pro GIu Leu GIu Asp Asp Pro Gin Ala Gly Gly 
120 125 " " 130 

aag caa aat tea tec gta agt gtg aat etc gcc tgg ggg gcg teg ctg 2347 
Lys Gin Asn Ser Ser Val Ser Val Asn Leu Val Trp Gly Val Leu Leu 
135 140 145 150 

cgt gtt tct ccg tea gac ccc agg cct gcg ctg gta acg gac ggt cac 2395 
Arg Val Ser Pro Ser Asp Pro Arg Pro Ala Leu Val Thr Asp Gly His 

155 ISO 165 

cac ctg ctg aat acg gga aac aca egg ctt tct ctt att egg get ggc 2443 
His Leu Leu Asn Thr Gly Asn Thr Arg Leu Ser Leu He Arg Ala Gly 
170 175 180 

aac tge gac acc aca tgc cac tgg cag aat ata ggc aaa agt att tac 2491 
Asn Cys Asp Thr Thr Cys His Trp Gin Asn He Gly Lys Ser He Tyr 
185 190 195 

ccc ggc ggg agt get gat att ccg gcc gga ata aaa agt aat gca ttt 2539 
Pro Gly Gly Ser Ala Asp He Pro Ala Gly lie Lys Ser Asn Ala Phe 
200 205 210 

cgt gtg gaa tat cgt acg ggt gca aat tea ccg gta ate tct get gat 2587 
Arg Val GIu Tyr Arg Thr Gly Ala Asn Ser Pro Val He Ser Ala Asp 
215 220 225 230 

tea aca gca gcc gga aag taa aaacacaegg agegtacget acaccccaca 263 8 

Leu Thr Ala Ala Gly Lys 

235 

Cttattctca gggggagegg atg tat acc gag tgt aca tat ate act gta ata 2691 

Met Tyr Thr GIu Cys Thr Tyr He Thr Val He 
240 245 

aac aac aaa gca agg tta ttt ttt atg aac atg aaa aca tct ttt att 2739 
Asn Asn Lys Ala Arg Leu Phe Phe Met Asn Met Lys Thr Ser Phe He 
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250 255 260 

gcc gca get gtg gca teg gec acc get tat tct ttt tct get tct gcg 
Ala Ala Ala Val Ala Leu Ala Thr Val Tyr Ser Phe Ser Val Ser Ala 
265 270 275 280 

gtt cag aag gat att acc gtc act gcc aat att gac agt aca ctt gaa 
Val Gin Lys Asp He Thr Val Thr Ala Asn He Asp Ser Thr Leu Glu 

285 290 295 

ctg ctg cag gcc gat ggt tea tec etc ccg teg act atg aag ctg gat 
Leu Leu Gin Ala Asp Gly Ser Ser Leu Pro Ser Thr Met Lys Leu Asp 
300 305 310 

ttc atg ccg ggt aag ggc ctg gtc cat aaa tea etc cag acc cgc ctt 
Phe Met Pro Gly Lys Gly Leu Val His Lys Ser Leu Gin Thr Arg Leu 
315 320 325 

tac age aac gat cag acc aag teg gtt aat gta aaa ctg ttg aat get 
Tyr Ser Asn Asp Gin Thr Lys Ser Val Asn Val Lys Leu Leu Asn Ala 
330 335 340 

cca caa ctt ate aac gtc ctg gat cec acc aaa ace att gat atg gaa 
Pro Gin Leu He Asn Val Leu Asp Pro Thr Lys Thr He Asp Met Glu 
345 350 355 360 

gtg act ctg gga gga egg tea ctg acc acc acc aat tct gta ctg gaa 
Val Thr Leu Gly Gly Arg Ser Leu Thr Thr Thr Asn Ser Val Leu Glu 

365 370 375 

get aaa acc ctg ttc ccg gac gca aaa act ggc gat get tea get ctg 
Ala Lys Thr Leu Phe Pro Asp Gly Lys Thr Gly Asp Ala Ser Ala Leu 
380 385 390 

ctg aac ctg gat att ggt cag aag get gga gca gcc tta caa aac ctg 
Leu Asn Leu Asp He Gly Gin Lys Ala Gly Ala Ala Leu Gin Asn Leu 
395 400 405 

cct gcc ggt gaa tac age gga ttg gtc agt ctg gtg att tea cag get 
Pro Ala Gly Glu Tyr Ser Gly Leu Val Ser Leu Val He Ser Gin Ala 
410 415 420 

gtc act gcc ggc taa taactggtta ttagctcttc atctgatccg gttttggggg 

Val Thr Ala Gly 

425 

gcaccgttcg tacctgaacc ggatccggta ttgatcttat tattcattgc aattcaggt 

tetttaegtg agtcgttatt tctgg atg tat tat tta ctg gga ttg tgc agt 

Met Tyr Tyr Leu Leu Gly Leu Cys Ser 
430 435 

ttt acc age cag gca act ctt att cec cct cct gga ttt gaa tct ctg 
Phe Thr Ser Gin Ala Thr Leu He Pro Pro Pro Gly Phe Glu Ser Leu 
440 445 450 

ctg gaa gga cag act gag caa att gaa gtg ttg eta cca ggg cat tea 
L u Glu Gly Gin Thr Glu Gin He Glu Val Leu Leu Pro Gly His Ser 
455 460 465 470 
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ccg gga cca tec ccg gcg gcg get aaa ccg gac acc gcg cag etc atg 3530 
Leu Gly Leu Phe Pro Val val Val Lys Pro Asp Thr Val Gin Phe Met 

475 480 485 

tec cca ccg acg gna ccc gaa age age ggg ctt gec gcg ccg ccg gec 3578 
Ser Pro Leu Met Val Leu Glu Ser Ser Gly Leu Ala Ala Leu Pro Ala 
490 49S 500 

gca gaa egg caa aaa gcg ctg get gca etc age cgt ccg ccg cca cgt 3626 
Ala Glu Arg Gin Lys Ala Leu Ala Ala Leu Ser Arg Pro Leu Leu Arg 
505 510 515 

aac age aat ctg gtc tgt ggt gtc tea gaa gca aaa gac age age gag 3674 
Asn Ser Asn Leu Val Cys Gly Val Ser Glu Ala Lys Asp Ser Ser Glu 
520 525 530 

tgt ggc tac gcg gca aca gat aaa gag gat get gcg gtt att cct gat 3722 
Cys Gly Tyr Val Ala Thr Asp Lys Glu Asp Val Ala Val He Phe Asp 
535 540 545 550 

gag aac aac get cag tea tct ttg ttt ctt aac egg gac egg teg ccg 3 770 
Glu Asn Asn Ala Gin Leu Ser Leu Phe Leu Asn Arg Asp Trp Leu Pro 

555 560 565 

gat gaa gaa cga cgt gac aaa cgc egg ccg acc ccg acc ccg gag ggc 3813 
Asp Glu Glu Arg Arg Asp Lys Arg Trp Leu Thr Pro Thr Pro Glu Gly 
570 575 580 

gtc age gca ccc act cac cgc cag acg ccg tat ctg age gac gac ccc 3866 
Val Ser Ala Phe He His Arg Gin Thr Leu Tyr Leu Ser Asp Asp Leu 
585 590 S95 

cac age cgt aat atg aca ctg aat ggt age ggt gec ctg ggg ctt ggt 3 914 
His Ser Arg Asn Met Thr Leu Asn Gly Ser Gly Ala Leu Gly Leu Gly 
600 605 610 

gac ggc cgc cac ccg gga ggc gac tag gcg get ate tgg aat cag cca 3 962 
Asp Gly Arg Tyr Leu Gly Gly Asp Trp Ala Ala He Trp Asn Gin Ser 
615 * 620 625 630 

gaa cac cac aat aac agt cag gec tgg ttt gac aat ctg ttt gtc cgt 4010 
Glu His Tyr Asn Asn Ser Gin Ala Trp Phe Asp Asn Leu Phe Val Arg 

635 640 645 

cag gac etc ggc aat cag tat tat ccc cag get ggt egg atg gac cag 4058 
Gin Asp Leu Gly Asn Gin Tyr Tyr Leu Gin Ala Gly Arg Met Asp Gin 
650 655 660 

egg aac ctg tec age gec acg ggg ggg gat ttt ggg tec agt ctg ctt 4106 
Arg Asn Leu Ser Ser Ala Thr Gly Gly Asp Phe Gly Phe Ser Leu Leu 
665 670 675 

ccc ctg age egg ttt gat gga tta cga acc ggg acc acc caa get tat 4154 
Pro Leu Ser Arg Phe Asp Gly Leu Arg Thr Gly Thr Thr Gin Ala Tyr 
630 685 690 



gtt aac cat gag gtg gac cat aat gee act ccg gtt acg get cag get 
Val Asn His Glu Val Asp His Asn Ala Thr Pro Val Mec Val Gin Val 
695 700 705 710 



4202 
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acc cga aat gcc cgt act gat att tat cgt ggc age gag ttg ctg ggg 4250 
Thr Arg Asn Ala Arg lie Asp He Tyr Arg Gly Ser Glu Leu Leu Gly 

715 720 725 

act cag etc ctg acc ccg gga atg cat acc ctg gat act cat tct ctt 4298 
Ser Gin Phe Leu Thr Pro Gly Met His Thr Leu Asp Thr His Ser Leu 
730 735 740 

cca ccg gga age tat cct ctg gcg ttg egg gtg tat gag gat ggg att 4346 
Pro Pro Gly Ser Tyr Pro Leu Ala Leu Arg Val Tyr Glu Asp Gly He 
745 750 755 

ctg egg cga acg gag acc cag ccc ttc agt aag ggg ggc aat age ttc 4394 
Leu Arg Arg Thr Glu Thr Gin Pro Phe Ser Lys Gly Gly Asn Ser Phe 
760 765 770 

agt gca cag acc cag tgg ttt att cag ggc ggg ctg gaa gat acc ggg 4442 
Ser Ala Gin Thr Gin Trp Phe He Gin Gly Gly Leu Glu Asp Thr Gly 
775 780 785 790 

gat aaa gcc age cat tat gac ggt gag act gtc atg get gcc gga ttc 4490 
Asp Lys Ala Ser His Tyr Asp Gly Glu Thr Val Met Ala Ala Gly Phe 

795 800 . 805 

caa act ggg ctg egg aaa aat ate agt ctg acc gaa ggt ate tct ctg 4538 
Gin Thr Gly Leu Arg Lys Asn He Ser Leu Thr Glu Gly He Ser Leu 
810 815 820 

gca cat gag gcc tgg tac agt gaa ace cga ctg aat tea cag cat gca 4586 
Ala His Glu Ala Trp Tyr Ser Glu Thr Arg Leu Asn Ser Gin His Ala 
82S 830 835 

gtg ctg gat ggc acg ctg gac ctt tct gcc ggg ata ctg cat ggg aca 4634 
Val Leu Asp Gly Thr Leu Asp Leu Ser Ala Gly lie Leu His Gly Thr 
840 845 850 

gac age acg age ggt aac act gag cag gtg aca tac aac gac gga ttt 4682 
Asp Ser Thr Ser Gly Asn Thr Glu Gin Val Thr Tyr Asn Asp Gly Phe 
855 860 865 870 

tec gcg agt ctg tgg cgt aac cat acg gaa agt gat gee tgt agt ggt 4730 
Ser Ala Ser Leu Trp Arg Asn His Thr Glu Ser Asp Ala Cys Ser Gly 

875 880 885 

cgt cat cca cag tea gtg cat gcc agt atg acc tgc cag act teg atg 4778 
Arg His Pro Gin Ser Val His Ala Ser Met Thr Cys Gin Thr Ser Met 
890 895 900 

aac gcc tec ctg teg gtt teg gtg ggg aac tgg tat gcc eta ctg gga 4826 
Asn Ala Ser Leu Ser Val Ser Val Gly Asn Trp Tyr Ala Leu Leu Gly 
905 910 915 

tac agt acc age agg aca gaa ggt egg ccg gtt tac egg gga tat gat 4 874 
Tyr Ser Thr Ser Arg Thr Glu Gly Arg Pro Val Tyr Arg Gly Tyr Asp 
920 925 930 

gat aac agt gac aaa gaa aat gtg ttc tgg cga cag gca tac ate cct 4 922 
Asp Asn Ser Asp Lys Glu Asn Val Phe Trp Arg Gin Ala Tyr lie Pro 
935 940 945 950 
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gcc tct cac cgc gaa tec gec cag gec age gca acg tac age ctt aat 4970 

Ala Ser His Arg Glu Ser Ala Gin Ala Ser Ala Thr Tyr Ser Leu Asn 

955 960 965 

atg get ggc atg aat att aat acc cat ggg gga gta tgg cga acc cga 5018 

Met Ala Gly Met Asn lie Asn Thr His Gly Gly Val Trp Arg Thr Arg 

970 975 980 



aat gac gga gtg aat gat gat ggc ctg ttt atg agt gtc agt gtg tea 
Asn Asp Gly Val Asn Asp Asp Gly Leu Phe Met Ser Val Ser Val Ser 
985 990 995 



5066 



tat gcc tct caa cca ccg aca atg act ggc agt aat agg tat acc tea 5114 
Tyr Ala Ser Gin Pro Pro Thr Met Thr Gly Ser Asn Arg Tyr Thr Ser 
1000 1005 1010 



gcc ggg acc gat att cac agt age egg aat caa aaa aca cag acg tec 5162 

Ala Gly Thr Asp lie His Ser Ser Arg Asn Gin Lys Thr Gin Thr Ser 

1015 1020 1025 1030 

egg aat gtg aac cat gtg aga tec egg cag cag gat erg tat cgt gaa 5210 

Trp Asn Val Asn His Val Arg Ser Trp Gin Gin Asp Leu Tyr Arg Glu 

1035 1040 1045 

ctg teg gtg ggt ttc tec ggt tat aac gac gac age tgg age ggg agt 5258 

Leu Ser Val Gly Phe Ser Gly Tyr Asn Asp Asp Ser Trp Ser Gly Ser 

1050 10S5 1060 

etc ggc gga cgc atg age ggc cgt atg ggt gaa ctg age gcc act ate 

Leu Gly Gly Arg Met Ser Gly Arg Met Gly Glu Leu Ser Ala Thr lie 
1065 1070 1075 

agt aac tec cat caa cgt aat gcg ggc age gcc agt tea etc ace get 

Ser Asn Ser His Gin Arg Asn Ala Gly Ser Ala Ser Ser Leu Thr Ala 
1080 1085 1090 

ggc cac age teg tct ctg gcg tea tec cgt aat gga ctg ttc tgg gga 

Gly Tyr Ser Ser Ser Leu Ala Leu Ser Arg Asn Gly Leu Phe Trp Gly 

1095 1100 1105 1110 

ggt ggt cag gac ggt gaa ccg gcc tct ggc atg gcg gtg aac gcg gag 

Gly Gly Gin Asp Gly Glu Pro Ala Ser Gly Met Ala Val Asn Val Glu 

1115 1120 112S 

tea gag ggg gac gag ggc agt age ggg aaa gta gtc age gtt cgt ggc 

Ser Glu Gly Asp Glu Gly Ser Ser Gly Lys Val Val Ser Val Arg Gly 

1130 1135 1140 

age age cag ccg ttc agt etc ggt ttt ggt cag cag teg ctg ctg ctg 5546 

Ser Ser Gin Pro Phe Ser Leu Gly Phe Gly Gin Gin Ser Leu Leu Leu 
1145 1150 1155 

atg gaa ggc tat aac gcc acg gag gtg acc att gag gat gca ggg gtt 5594 

Met Glu Gly Tyr Asn Ala Thr Glu Val Thr lie Glu Asp Ala Gly Val 
1160 1165 1170 

agt tea cag ggt atg gca ggc gta aaa gcg gga ggg gga age agg tgt 5642 

Ser Ser Gin Gly Met Ala Gly Val Lys Ala Gly Gly Gly Ser Arg Cys 



5306 



5354 



5402 



5450 



5498 
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1175 H80 1185 1190 

tac tec ctg aca ccc ggg cac ctg ctg gtt cac aac ate age gee age 5690 
Tyr Phe Leu Thr Pro Gly His Leu Leu Val His Asn He Ser Ala Ser 
1195 1200 1205 

acg age cga ctg cac gtt ggc cgc gta ctg gac aag gat ggc aga ccg 5738 
Met Ser Arg Leu Tyr Val Gly Arg Val Leu Asp Lys Asp Gly Arg Pro 
1210 1215 1220 

ctg ctg gac gca cag cca ctg aac tat cca ttt ttg teg teg gga cct 5786 
Leu Leu Asp Ala Gin Pro Leu Asn Tyr Pro Phe Leu Ser Leu Gly Pro 
1225 1230 1235 

tec ggg cga ttt age ctg cag age gag cat aaa gaa tec age ctg egg 5834 
Ser Gly Arg Phe Ser Leu Gin Ser Glu His Lys Glu Ser Ser Leu Trp 
1240 1245 1250 

ctg ctg tct aaa aac agg ate ctg cgt tgt ccg atg tea gta cat aaa 5882 
Leu Leu Ser Lys Asn Arg He Leu Arg Cys Pro Met Ser Val His Lys 
1255 1260 1265 1270 

cgt egg gat gtt atg cag gta gtg ggt gat gtg egg tgt gaa tta agt 5930 
Arg Arg Asp Val Met Gin Val Val Gly Asp Val Arg Cys Glu Leu Ser 
1275 1280. 1285 

gac gtg gat gee ctg cca cag gcg ttg caa ata teg ccg egg gtc ate 
Asp Val Asp Ala Leu Pro Gin Ala Leu Gin He Ser Pro Arg Val He 
1290 1295 1300 

cgt ttg ctg aac gtg gca ggt ttg ctg cgc cat tec gtt cag gaa gec 
Arg Leu Leu Asn Val Ala Gly Leu Leu Arg His Ser Val Gin Glu Ala 
1305 1310 1315 

tga cgtagagata aaggcgttaa ct atg agt aat aaa atg aag tgg acg agt 6078 

Met Ser Asn Lys Met Lys Trp Thr Ser 
1320 1325 



5978 



6026 



atg aca gec cat tgg tea gca att att aat ttc ate cga aaa tat gtt 
Met Thr Ala His Trp Ser Ala He lie Asn Phe He Arg Lys Tyr Val 
1330 1335 1340 



6126 



6270 



tat cca gca agg ata att gec ate ctg ctg atg get ggc get aca ctg 6174 
Tyr Pro Ala Arg He He Ala He Leu Leu Met Ala Gly Ala Thr Leu 
134S 1350 1355 1360 

cca caa gtc gee gat gcg att ace gtc gac ctg aat tac gac aag aac 6222 
Pro Gin Val Ala Asp Ala He Thr Val Asp Leu Asn Tyr Asp Lys Asn 
1365 1370 1375 

aat gta gcg gtc ate act cct gtc tgg tec caa gaa tgg agt gta gca 
Asn Val Ala Val He Thr Pro Val Trp Ser Gin Glu Trp Ser Val Ala 
1380 1385 1390 

aat gtg ttg ggg gga tgg gta tgt cgt tea aac agg aat gaa aat gag 6318 
Asn Val Leu Gly Gly Trp Val Cys Arg Ser Asn Arg Asn Glu Asn Glu 
1395 1400 1405 

ggg gcg tgt gaa gaa aca cat ttg gta tgg tgg tat get ttt gga get 6366 
Gly Ala Cys Glu Glu Thr His Leu Val Trp Trp Tyr Ala Phe Gly Ala 
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1410 1415 1420 

tat tea aaa att cgt ctg cgt ttc aga gaa caa. ate age eat gee gaa 6414 
Tyr Ser Lys lie Arg Leu Arg Phe Arg Glu Gin lie Ser His Ala Glu 
1425 1430 1435 1440 

att acg etc ata ctg etc ggc agt gtt cgt gat gec tgt tat act ggt 5462 
lie Thr Leu lie Leu Leu Gly Ser Val Arg Asp Ala Cys Tyr Thr Gly 
1445 1450 1455 

gtc ate aac atg aac get get gca tgt caa tgg ggt agg teg ctg aaa 6510 
Val lie Asn Met Asn Ala Ala Ala Cys Gin Trp Gly Arg Ser Leu Lys 
1460 1465 1470 

ctt agg ata cct tea gaa gag ctt gcg aag ata cct aca age gga aca 6558 
Leu Arg He Pro Ser Glu Glu Leu Ala Lys He Pro Thr Ser Gly Thr 
1475 1480 1485 

tgg aaa gca acg tta gtc ctg gat tat tta caa tgg ggc gga gac gat 6606 
Trp Lys Ala Thr Leu Val Leu Asp Tyr Leu Gin Trp Gly Gly Asp Asp 
1490 1495 1500 



cct tta ggc aca tea act aca gat ate acg ctg aat gta aca gac cac 6654 
Pro Leu Gly Thr Ser Thr Thr Asp He Thr Leu Asn Val Thr Asp His 
1505 1510 1515 1520 



ttt get gaa aat gcg get att tac ttt ccg caa ttt ggt aca gca acg 
Phe Ala Glu Asn Ala Ala He Tyr Phe Pro Gin Phe Gly Thr Ala Thr 
1525 1530 1535 



ggc agg get aat ctg gat atg tgt ctg tat gac gga ggt gtg aaa gee 

Gly Arg Ala Asn Leu Asp Met Cys Leu Tyr Asp Gly Gly Val Lys Ala 

1555 1560 1565 

cgt tea tta cag atg aag ata gaa gga age aat aag tea ggt acg gga 

Arg Ser Leu Gin Met Lys He Glu Gly Ser Asn Lys Ser Gly Thr Gly 

1570 1575 1580 

ttt cag gtt ata aag age gat tct get gat acg att gat tat gcg gtc 

Phe Gin Val He Lys Ser Asp Ser Ala Asp Thr He Asp Tyr Ala Val 

1585 1590 1595 1600 



ttc agt ctg gat aac gtg gat aaa gca gca acg cgt ccg gtg gta ctt 

Phe Ser Leu Asp Asn Val Asp Lys Ala Ala Thr Arg Pro Val Val Leu 
1620 1625 1630 

ccc ggg caa egg cag gcg gta cgt cgt gtg cca gtg ccc ctt ace ctg 

Pro Gly Gin Arg Gin Ala Val Arg Cys Val Pro val Pro Leu Thr Leu 
1635 1640 1645 



6702 



ccc egg gtg gac ctg aat ctt cac egg atg aat gee tea caa atg teg 6750 
Pro Arg Val Asp Leu Asn Leu His Arg Met Asn Ala Ser Gin Met Ser 
1540 1545 1550 



6798 



6846 



6894 



agt atg aat tat ggg gga cga agt att cct gtc ace cgt ggc gtg gag 6942 
Ser Met Asn Tyr Gly Gly Arg Ser He Pro Val Thr Arg Gly Val Glu 
1605 1610 1615 



6990 



7038 
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aca aca caa ccc ttt aac ate aga gag aag cgt tec ggt gag tat cag 7086 
Thr Thr Gin Pro Phe Asn lie Arg Glu Lys Arg Ser Gly Glu Tyr Gin 
1650 1655 1660 

gga acg ctg aca gtg aca atg ctg atg gga aca caa acc ccc tga 7131 
Gly Thr Leu Thr Val Thr Met Leu Met Gly Thr Gin Thr Pro 
1665 1670 1675 

cagtaattat ttattttatt gatatctttc ttatatggtc ttttaaatca gagctctctt 7191 

tatatacttg ttttattcaa taaagagaat ctattcactt atgaaaatca atgcgtgagg 7251 

ttctgettte cc atg act gtg tat tta gat gat aaa gat aaa gaa tta ttg 7302 
Met Thr Val Tyr Leu Asp Asp Lys Asp Lys Glu Leu Leu 
1680 1685 1690 

aaa gaa ate caa aaa gat tgt gca caa act tta tgg caa ctt gca tat 73 50 
Lys Glu He Gin Lys Asp Cys Ala Gin Thr Leu Trp Gin Leu Ala Tyr 
1695 1700 1705 

aaa gtg gga ctt acg ccc aca cca tgt etc aaa cgt tta aaa aaa ctt 7398 
Lys Val Gly Leu Thr Pro Thr Pro Cys Phe Lys Arg Leu Lys Lys Leu 
1710 1715 1720 

aaa gac agg ggg gtt ate att cgt cag ttc get tta ttg gat aag gaa 7446 
Lys Asp Arg Gly Val He He Gly Gin Phe Ala Leu Leu Asp Lys Glu 
1725 1730 1735 1740 

aaa eta ggt ctt tea ctt aat gtc ttt att atg att aac ata tct gag 7494 
Lys Leu Gly Leu Ser Leu Asn Val Phe He Met He Asn lie Ser Glu 
1745 1750 1755 

gag caa tac get agt att tct gag aaa ata aag tea atg cct gag gtt 7S42 
Glu Gin Tyr Ala Ser lie Ser Glu Lys He Lys Ser Met Pro Glu Val 
1760 1765 1770 

att gee ttt tat cga att tct gga tea ttt aat tat tta atg cat aca 7590 
He Ala Phe Tyr Arg He Ser Gly Ser Phe Asn Tyr Leu Met His Thr 
1775 1780 1785 

gta ttt aca gat atg aac gat tac tat agt ttt tat gag aaa ata ata 7638 
Val Phe Thr Asp Met Asn Asp Tyr Tyr Ser Phe Tyr Glu Lys He lie 
1790 1795 1800 

tta act aat tct tea att agt gga tct gca teg age ttt gtt ctt gag 7686 
Leu Thr Asn Ser Ser He Ser Gly Ser Ala Ser Ser Phe Val Leu Glu 
1805 1810 1815 1820 

caa ata aag gaa aca aac gaa ctg tea gtg tga aagtgtgatg tgtacttact 773 9 
Gin He Lys Glu Thr Asn Glu Leu Ser Val 
1825 1830 

gatttaatac attattatcc ttcttacgga acaacaaegg cagattgegg ctgttgaaca 7799 

aggattttaa tcagcagtgg tgaaattaag eggcacagaa taacacagcg gaatatcaca 7859 

tggttaaata tcaccccgtg catgtaacaa aaaacegcat taaaacagat gatgttactg 7919 

atatttattt cgttgaaccc ttctggaaaa aaggcgaaaa ccacataatt gagtcattga 7979 
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tgttctttga agagttacaa aagtcattta atttattcaa ccataaatat gggttaaata 8039 
aatatatact caggatcccc tgggaatttg tgctcataca catggaaagg atcagtaaat 8099 
taaatagcgt cgggttattt gctgtttctg ttgactttaa taacaaccac aaactcccga 8159 
gcgagcacac caggagtcgc agagattatg gtacggaagt ttggtttgat ttttgtggta 8219 
aacactccca ttccagtgaa attaaaaacc ttggattctt ttttcaggct tgcgtagtgc 8279 
ctcgtgatcc caattttatc agtagtgttt accattatca taagttccaa aagattctcg 8339 
tcggggacac aaacgacgta gaacagaggg ccgtgtacca gaacgaagtt gantacatgt 8399 
atggaatgca atggccatcg tcatatgacg gttcttcctt tcgggatcat aaaaaaaatg 8459 
aaactcggcg tatataacag aaggagtgaa aatttgaatc aaaaatatct tatttatttt 8519 
ttgtttaatt attgttttgt tttttattac gatcaaacac aaagaacatc attgttcgtg 8579 
cggtgcggag gctggaagtt taggggatga ccgtttatca acaactttac cacagccacc 863 9 
atacgaatgg tttatatatg cactagatgt attattttag cccaacacac cgatggttgc 8699 
tacttgcatt gatgatgttc cgccacacta aggaatatac atctgtatct cgttatacgc 8759 
acactcacat tactaatcat tattaatatg agtgtggttc ttgttttacg catgcatggt 8819 
tgcatgtgac gttaaattta aatgagctga ccgtatgaat tctaaatact ttagagaggc 8879 
gttttttgtc tcggtagttg ttatattatt attttatttg gtgttattcg cagccagtgc 8939 
tcatgctgaa ggcggtttca gatctggagg cattgggtta tttatgacgg gaacaagaga 8 999 
gatgccactg tagagataat aaatcctgct aaagattccc caattcttgt gcattgacat 9059 
cctccacgtc ccgaagggcg tgggttcctg ctccaacggg ctgcctgact gcacgcccct 9119 
tccacaggca agcacggcgt gtcccgctct aaaatgttac gcgcgccgtt tacatcggcg 9179 
ttcgcagtat atcttcatac cagacacttg taagtatctc gcataaccgt gccattcaca 923 9 
tttagagatc atac 9253 

<210> 2 
<211> 236 
<212> PRT 

<213> Salmonella typhi 
<400> 2 

Met Asn Phe Lys Asp Thr Leu Pro Gly Val Phe Leu Cys Val Ala Met 
15 10 15 

Phe Ala Cys Gly His Ala Arg Ala Asn Met Leu Val Tyr Pro Met Ala 
20 25 30 

Ala Glu lie Asn Ser Ser Arg Glu Glu Ala Thr Ser Leu Phe Val Tyr 
35 40 45 
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Ser Lys Ser Asp His Val Gin Tyr He Arg Thr Arg He Met Arg He 
SO 55 60 

Glu His Pro Gly Met Pro Gin Glu Lys Glu Val Pro Ala Gly Asn Asp 
65 *?0 75 80 

He Glu Thr Gly Leu Val Val Ser Pro Glu Lys Phe Ala Leu Ser Pro 

85 90 95 

Gly Thr Lys Lys Thr He Arg Val lie Ser Thr Gin Ala Pro Glu Arg 
100 105 110 

Glu Glu Ala Trp Arg Val Tyr Phe Glu Ala Val Pro Glu Leu Glu Asp 
115 120 125 

Asp Pro Gin Ala Gly Gly Lys Gin Asn Ser Ser Val Ser Val Asn Leu 
130 135 140 

Val Trp Gly Val Leu Leu Arg Val Ser Pro Ser Asp Pro Arg Pro Ala 
145 150 155 160 

Leu Val Thr Asp Gly His His Leu Leu Asn Thr Gly Asn Thr Arg Leu 

165 170 175 

Ser Leu He Arg Ala Gly Asn Cys Asp Thr Thr Cys His Trp Gin Asn 
180 185 190 

He Gly Lys Ser He Tyr Pro Gly Gly Ser Ala Asp He Pro Ala Gly 
195 200 205 

He Lys Ser Asn Ala Phe Arg Val Glu Tyr Arg Thr Gly Ala Asn Ser 
210 215 220 

Pro Val He Ser Ala Asp Leu Thr Ala Ala Gly Lys 
225 230 235 



<210> 3 
<211> 191 
<212> PRT 

<213> Salmonella typhi 
<400> 3 

Met Tyr Thr Glu Cys Thr Tyr He Thr Val He Asn Asn Lys Ala Arg 
15 10 15 

Leu Phe Phe Met Asn Met Lys Thr Ser Phe He Ala Ala Ala Val Ala 
20 25 30 

Leu Ala Thr Val Tyr Ser Phe Ser Val Ser Ala Val Gin Lys Asp He 
35 40 45 

Thr Val Thr Ala Asn lie Asp Ser Thr Leu Glu Leu Leu Gin Ala Asp 
50 55 60 

Gly Ser Ser Leu Pro Ser Thr Met Lys Leu Asp Phe Met Pro Gly Lys 
65 70 75 80 

Gly Leu Val His Lys Ser Leu Gin Thr Arg Leu Tyr Ser Asn Asp Gin 

85 90 95 
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Thr Lys Ser Val Asn Val 
100 

Val Leu Asp Pro Thr Lys 
115 

Arg Ser Leu Thr Thr Thr 
130 

Pro Asp Gly Lys Thr Gly 
145 ISO 

Gly Gin Lys Ala Gly Ala 
165 

Ser Gly Leu Val Ser Leu 
180 



Lys Leu Leu Asn Ala 
105 

Thr lie Asp Met Glu 
120 

Asn Ser Val Leu Glu 
135 

Asp Ala Ser Ala Leu 

155 

Ala Leu Gin Asn Leu 
170 

Val lie Ser Gin Ala 
18S 



Pro Gin Leu lie Asn 
110 

Val Thr Leu Gly Gly 
125 

Ala Lys Thr Leu Phe 
140 

Leu Asn Leu Asp lie 

160 

Pro Ala Gly Glu Tyr 
175 

Val Thr Ala Gly 
190 



<210> 4 
<211> 889 
<212> PRT 

<213> Salmonella typhi 



<400> 4 

Met Tyr Tyr Leu Leu Gly Leu Cys 
1 S 

He Pro Pro Pro Gly Phe Glu Ser 
20 

He Glu Val Leu Leu Pro Gly His 
35 40 



Ser Phe Thr Ser Gin Ala Thr Leu 
10 15 

Leu Leu Glu Gly Gin Thr Glu Gin 
25 30 

Ser Leu Gly Leu Phe Pro Val Val 

45 



Val Lys Pro Asp Thr Val Gin Phe Met Ser Pro Leu Met Val Leu Glu 
SO 55 60 

Ser Ser Gly Leu Ala Ala Leu Pro Ala Ala Glu Arg Gin Lys Ala Leu 
65 70 75 80 

Ala Ala Leu Ser Arg Pro Leu Leu Arg Asn Ser Asn Leu Val Cys Gly 

85 90 95 

Val Ser Glu Ala Lys Asp Ser Ser Glu Cys Gly Tyr Val Ala Thr Asp 
100 105 110 

Lys Glu Asp Val Ala Val He Phe Asp Glu Asn Asn Ala Gin Leu Ser 
US 120 125 

Leu Phe Leu Asn Arg Asp Trp Leu Pro Asp Glu Glu Arg Arg Asp Lys 
130 135 140 

Arg Trp Leu Thr Pro Thr Pro Glu Gly Val Ser Ala Phe He His Arg 
145 150 155 160 

Gin Thr Leu Tyr Leu Ser Asp Asp Leu His Ser Arg Asn Met Thr Leu 
165 170 175 
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Asn Gly Ser Gly 
180 

Asp Trp Ala Ala 
195 

Ala Trp Phe Asp 
210 

Tyr Leu Gin Ala 
225 

Gly Gly Asp Phe 



Leu Arg Thr Gly 
260 

Asn Ala Thr Pro 
275 

He Tyr Arg Gly 
290 

Met His Thr Leu 
305 

Ala Leu Arg Val 



Pro Phe Ser Lys 
340 

He Gin Gly Gly 
355 

Gly Glu Thr Val 
370 

He Ser Leu Thr 
385 

Glu Thr Arg Leu 



Leu Ser Ala Gly 
420 

Glu Gin val Thr 
435 

His Thr Glu Ser 
450 

Ala Ser Met Thr 
465 

Val Gly Asn Trp 



Ala Leu Gly Leu 



He Trp Asn Gin 
200 

Asn Leu Phe Val 
215 

Gly Arg Met Asp 
230 

Gly Phe Ser Leu 
24S 

Thr Thr Gin Ala 



Val Met Val Gin 
280 

Ser Glu Leu Leu 
295 

Asp Thr His Ser 
310 

Tyr Glu Asp Gly 
325 

Gly Gly Asn Ser 



Leu Glu Asp Thr 
360 

Met Ala Ala Gly 
375 

Glu Gly He Ser 
390 

Asn Ser Gin His 
405 

He Leu His Gly 



Tyr Asn Asp Gly 
440 

Asp Ala Cys Ser 
455 

Cys Gin Thr Ser 
470 

Tyr Ala Leu Leu 
485 



Gly Asp Gly Arg 
185 

Ser Glu His Tyr 



Arg Gin Asp Leu 
220 

Gin Arg Asn Leu 
235 

Leu Pro Leu Ser 
250 

Tyr Val Asn His 
265 

Val Thr Arg Asn 



Gly Ser Gin Phe 
300 

Leu Pro Pro Gly 
315 

He Leu Arg Arg 
330 

Phe Ser Ala Gin 
345 

Gly Asp Lys Ala 



Phe Gin Thr Gly 
380 

Leu Ala His Glu 
395 

Ala Val Leu Asp 
410 

Thr Asp Ser Thr 
425 

Phe Ser Ala Ser 



Gly Arg His Pro 
460 

Met Asn Ala Ser 
475 

Gly Tyr Ser Thr 
490 



Tyr Leu Gly Gly 
190 

Asn Asn Ser Gin 
205 

Gly Asn Gin Tyr 



Ser Ser Ala Thr 
240 

Arg Phe Asp Gly 
255 

Glu Val Asp His 
270 

Ala Arg He Asp 
285 

Leu Thr Pro Gly 



Ser Tyr Pro Leu 
320 

Thr Glu Thr Gin 
335 

Thr Gin Trp Phe 
350 

Ser His Tyr Asp 
365 

Leu Arg Lys Asn 



Ala Trp Tyr Ser 
400 

Gly Thr Leu Asp 
415 

Ser Gly Asn Thr 
430 

Leu Trp Arg Asn 
445 

Gin Ser Val His 



Leu Ser Val Ser 
480 

Ser Arg Thr Glu 
495 
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Gly Arg Pro Val Tyr Arg Gly Tyr Asp Asp Asn Ser Asp Lys Glu Asn 
500 505 510 

Val Phe Trp Arg Gin Ala Tyr He Pro Ala Ser His Arg Glu Ser Ala 
515 520 525 

Gin Ala Ser Ala Thr Tyr Ser Leu Asn Met Ala Gly Met Asn lie Asn 
530 535 S40 

Thr His Gly Gly Val Trp Arg Thr Arg Asn Asp Gly Val Asn Asp Asp 
545 550 555 560 

Gly Leu Phe Met Ser Val Ser Val Ser Tyr Ala Ser Gin Pro Pro Thr 
565 570 575 

Met Thr Gly Ser Asn Arg Tyr Thr Ser Ala Gly Thr Asp He His Ser 
580 585 S90 

Ser Arg Asn Gin Lys Thr Gin Thr Ser Trp Asn Val Asn His Val Arg 
595 €00 605 

Ser Trp Gin Gin Asp Leu Tyr Arg Glu Leu Ser Val Gly Phe Ser Gly 
610 615 620 

Tyr Asn Asp Asp Ser Trp Ser Gly Ser Leu Gly Gly Arg Met Ser Gly 
625 630 635 640 

Arg Met Gly Glu Leu Ser Ala Thr He Ser Asn Ser His Gin Arg Asn 
645 650 655 

Ala Gly Ser Ala Ser Ser Leu Thr Ala Gly Tyr Ser Ser Ser Leu Ala 
660 665 670 

Leu Ser Arg Asn Gly Leu Phe Trp Gly Gly Gly Gin Asp Gly Glu Pro 
675 680 685 

Ala Ser Gly Met Ala Val Asn Val Glu Ser Glu Gly Asp Glu Gly Ser 
690 695 700 

Ser Gly Lys Val Val Ser Val Arg Gly Ser Ser Gin Pro Phe Ser Leu 
705 710 715 720 

Gly Phe Gly Gin Gin Ser Leu Leu Leu Met Glu Gly Tyr Asn Ala Thr 
725 730 735 

Glu Val Thr He Glu Asp Ala Gly Val Ser Ser Gin Gly Met Ala Gly 
740 745 750 

Val Lys Ala Gly Gly Gly Ser Arg Cys Tyr Phe Leu Thr Pro Gly His 
75S 760 765 

Leu Leu Val His Asn He Ser Ala Ser Met Ser Arg Leu Tyr Val Gly 
770 775 780 

Arg Val Leu Asp Lys Asp Gly Arg Pro Leu Leu Asp Ala Gin Pro Leu 
785 790 795 800 

Asn Tyr Pro Phe Leu Ser Leu Gly Pro Ser Gly Arg Phe Ser Leu Gin 
805 810 815 



Substitute sheet (Rule 26) 



WO 00/73336 



PCT/SE00/01079 



Ser Glu.His Lys Glu Ser Ser Leu 
820 

Leu Arg Cys Pro Met Ser Val His 
835 840 

Val Gly Asp Val Arg Cys Glu Leu 
850 855 

Ala Leu Gin lie Ser Pro Arg Val 
865 870 

Leu Leu Arg His Ser Val Gin Glu 
885 



51 

Trp Leu Leu Ser Lys Asn Arg He 
825 830 

Lys Arg Arg Asp Val Met Gin Val 

845 

Ser Asp Val Asp Ala Leu Pro Gin 
860 

lie Arg Leu Leu Asn Val Ala Gly 
875 B60 

Ala 



<210> 5 

<211> 359 

<212> PRT 

<213> Salmonella typhi 



<400> 5 

Met Ser Asn Lys Met Lys Trp Thr Ser Met Thr Ala His Trp Ser Ala 
1 5 10 15 

He He Asn Phe He Arg Lys Tyr val Tyr Pro Ala Arg He He Ala 
20 25 30 



He Leu Leu Met Ala Gly Ala Thr Leu Pro Gin Val Ala Asp Ala He 
35 40 45 

Thr Val Asp Leu Asn Tyr Asp Lys Asn Asn Val Ala Val He Thr Pro 
50 55 60 

Val Trp Ser Gin Glu Trp Ser Val Ala Asn Val Leu Gly Gly Trp Val 
65 70 75 80 

Cys Arg Ser Asn Arg Asn Glu Asn Glu Gly Ala Cys Glu Glu Thr His 

85 90 95 



Leu Val Trp Trp Tyr Ala Phe Gly Ala Tyr Ser Lys He Arg Leu Arg 
100 105 HO 

Phe Arg Glu Gin He Ser His Ala Glu He Thr Leu He Leu Leu Gly 
115 120 125 

Ser val Arg Asp Ala Cys Tyr Thr Gly Val He Asn Met Asn Ala Ala 
130 135 140 



Ala Cys Gin Trp 
145 

Leu Ala Lys He 



Gly Arg Ser Leu 
150 

Pro Thr Ser Gly 
165 



Lys Leu Arg He 
155 

Thr Trp Lys Ala 

170 



Pro Ser Glu Glu 
160 

Thr Leu Val Leu 
175 



Asp Tyr Leu Gin Trp Gly Gly Asp 
180 

Asp He Thr Leu Asn Val Thr Asp 

195 200 



Asp Pro Leu Gly Thr Ser Thr Thr 
185 190 

His Phe Ala Glu Asn Ala Ala He 

205 
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Tyr Phe Pro Gin Phe Gly Thr Ala Thr Pro Arg Val Asp Leu Asn Leu 
210 215 220 

His Arg Met Asn Ala Ser Gin Met Ser Gly Arg Ala Asn Leu Asp Met 
225 230 235 240 

Cys Leu Tyr Asp Gly Gly Val Lys Ala Arg Ser Leu Gin Met Lys lie 
245 250 255 

Glu Gly Ser Asn Lys Ser Gly Thr Gly Phe Gin Val lie Lys Ser Asp 
260 265 270 

Ser Ala Asp Thr He Asp Tyr Ala Val Ser Met Asn Tyr Gly Gly Arg 
275 280 285 

Ser lie Pro Val Thr Arg Gly Val Glu Phe Ser Leu Asp Asn Val Asp 
290 295 300 

Lys Ala Ala Thr Arg Pro Val Val Leu Pro Gly Gin Arg Gin Ala Val 
305 310 315 320 

Arg Cys Val Pro Val Pro Leu Thr Leu Thr Thr Gin Pro Phe Asn He 
325 330 335 



Arg Glu Lys Arg Ser Gly Glu Tyr Gin Gly Thr Leu Thr Val Thr Met 
340 345 350 



Leu Met Gly Thr Gin Thr Pro 
355 



<210> 6 
<211> 151 
<212> PRT 

<213> Salmonella typhi 
<400> 6 

Met Thr Val Tyr Leu Asp Asp Lys Asp Lys Glu Leu Leu Lys Glu He 
15 10 15 

Gin Lys Asp Cys Ala Gin Thr Leu Trp Gin Leu Ala Tyr Lys Val Gly 
20 25 30 

Leu Thr Pro Thr Pro Cys Phe Lys Arg Leu Lys Lys Leu Lys Asp Arg 
35 40 45 



Gly Val He He Gly Gin Phe Ala 
50 55 

Leu Ser Leu Asn Val Phe He Met 
65 70 

Ala Ser He Ser Glu Lys He Lys 

85 

Tyr Arg lie Ser Gly Ser Phe Asn 
100 



Leu Leu Asp Lys Glu Lys Leu Gly 
60 

lie Asn He Ser Glu Glu Gin Tyr 
75 80 

Ser Met Pro Glu Val He Ala Phe 
90 95 

Tyr Leu Met His Thr Val Phe Thr 
105 110 
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Asp Met Asn Asp Tyr Tyr Ser Phe Tyr Glu Lys lie lie Leu Thr Asn 
11S 120 12S 

Ser Ser He Ser Gly Ser Ala Ser Ser Phe Val Leu Glu Gin He Lys 
UO 135 140 

Glu Thr Asn Glu Leu Ser Val 
145 150 
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